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DESCRIPTION 
PROTEIN KINASES 

FIELD OF THE INVENTION 
5 The present invention relates to novel kinase polypeptides, nucleotide sequences 

encoding the novel kinase polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase-related diseases and conditions. 

BACKGROUND OF THE INVENTION 
10 The following description of the background of the invention is provided to aid in 

understanding the invention, but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. One of the key 

15 biochemical mechanisms of signal transduction involves the reversible phosphorylation of 

proteins, which enables regulation of the activity of mature proteins by altering their 
structure and function. 

Protein phosphorylation plays a pivotal role in biological signal transduction. 
Among the biological functions controlled by protein phosphorylation are the following: 

20 cell division; differentiation and death (apoptosis); cell motility and cytoskeletal structure; 

control of DNA replication, transcription, splicing and translation; protein translocation 
events from the endoplasmic reticulum and Golgi apparatus to the membrane and 
extracellular space; protein nuclear import and export; regulation of metabolic reactions, 
etc. Abnormal protein phosphorylation is widely recognized to be causally linked to the 

25 etiology of many diseases including cancer as well as immunologic, neuronal and 

metabolic disorders. 

The most common phospho-acceptor amino acid residues are serine, threonine and 
tyrosine. Phosphorylation in histidine has also been observed in bacteria. The presence of 
a phosphate moeity modulates protein function in multiple ways. A common mechanism 
30 includes changes in the catalytic properties (V max and K m ) of an enzyme leading to its 
activation or inactivation. A second widely recognized mechanism involves promoting 
protein-protein interactions. An example of this is the tyrosine autophosphorylation of the 
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ligand-activated EGF receptor tyrosine kinase. This event triggers the high-affinity 
binding 10 the phosphotyrosine residue on the receptor's C-terminal intracellular domain 
to the SH2 motif of the adaptor molecule Grb2. Grb2 in turn binds through its SH3 motif 
to a second adaptor molecule, such as SHC. The formation of this ternary complex 
acivates the signaling events that are responsible for the biological effects of EGF. Serine 
and threonine phosphorylation events have also being recently recognized to exert their 
biological function through protein-protein interaction events mediated by the high- 
affinity binding of phosphoserine and phosphothreonine to WW motifs present in a large 
.variety of proteins (Lu, PJ. ei ai (1999) Science 283:1325-1328). A third important 
outcome of protein phosphorylation is changes in the subcellular localization of the 
substrate. As an example, nuclear import and export events in a large diversity of proteins 
are regulated by protein phosphorylation (Drier E.A. el ai (1999) Genes Dev 13: 556- 
568). 

Protein kinases are one of the largest families of eukaryotic proteins with several 
hundred known members. These proteins share a 250-300 amino acid domain that can be 
subdivided into 12 distinct subdomains that compose the common catalytic core structure. 
These conserved protein motifs have recently been exploited using PCR-based and 
bioinformatic strategies leading to a significant expansion of the known kinases. Multiple 
alignment of the sequences in the catalytic domain of protein kinases and subsequent 
parsimony analysis permits their segregation into a dendrogram reflecting the relatedness 
of their catalytic domains (Fig. 1). In this manner, related kinases are clustered into 
distinct branches or subfamilies including: tyrosine kinases, cyclic-nucleotide-dependent 
kinases, calcium/calmodulin kinases, cyclin-dependent kinases and MAP-kinases, serine- 
threonine kinase receptors, and several other less defined subfamilies. 

We have recently completed a systematic analysis of the protein kinases present in 
C elegans, the multicellular organism whose entire DNA sequence has been determined. 
We identified 473 unique kinase profiles including 398 full-length conventional kinases, 
and 20 additional proteins that may function as atypical protein kinases. (Plowman G.D. 
etai (1999), Proc. Natl. Acad. Sci. 96:13603-13610). 

Using parsimony analysis, the protein kinases may be divided into 4 major groups: 
AGC, CAMK, CMGC and tyrosine kinases. In addition, there are a number of minor yet 
distinct families, including the STE and casein kinase 1, families related to wonn- or 
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fungal-specific kinases, and a family designated "other" to represent several smaller 
families. In addition, we designate an "atypical" family to represent protein kinases whose 
catalytic domain has little or no primary sequence homology to conventional kinases, 
including the A6 kinases and P13 kinases. 

5 The AGC kinases are basic amino acid-directed enzymes that phosphorylate 

residues found proximal to Axg and Lys. Examples of this group are the cyclic nucleotide- 
dependent kinases, G protein kinases, NDR or DBF2 and the ribosomal S6 kinases. 

The CAMK group kinases are also basic amino acid-directed kinases. They include 
the Ca2+/calmodulin-regulated and AMP-dependent protein kinases, myosin light chain 

10 kinases, checkpoint 2 kinases (CHK2) and EMK-related protein kinases. The EMK family 
of STK are involved in the control of cell polarity, micotubule stability and cancer. One 
member of the EMK family, C-TAK1 has been reported to control entry into mitosis by 
activating Cdc25C which in tum dephosphorylates Cdc2. 

CMGC group kinases are "proline-directed" enzymes phosphorylating residues 

1 5 that exist in a proline-rich context. They include the cyclin-dependent kinases (CDKs), 

mitogen-activated kinases (MAPKs), GSK3s and CLKs. Most CMGC kinases have 
larger-than-average kinase domains owing to the presence of insertions within subdomains 
X and XI. 

The tyrosine kinase group encompass both cytoplasmic (i.e. src) as well as 
20 transmembrane receptor tyrosine kinases (i.e. EGF receptor). These kinases play a pivotal 

role in the signal transduction processes lhat mediate cell proliferation, differentiation and 
apoptotis. 

Group members that define smaller, yet distinct phylogenetic branches of 
conventional kinases include the elongation factor 2 kinases (EIFKs); homologues of the 

25 yeast sterile family kinases (STE) which refers to 3 classes of kinases which lie 

sequentially upstream of the MAPKs; mixed lineage kinases (MLKs); Lim-domain 
containing kinases (LIMKs); Calcium-calmodulin kinase kinases (CAMKK), dual-specific 
tyrosine kinases (DYRK), integrin receptor associated kinase (IRAK); lestis-specific 
kinases (TSK); UNC-51 related kinases (UNC); several families that are close 

30 homologues to worm (C26C2.1, YQ09, ZC581.9, YFL033c, C24A1.3), Drosophila 
(SLOB), or yeast (YDOD_sp, YGR262_sc) kinases, and others that are "unique" and 
don't cluster into any obvious family. 
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SUMMARY OF THE INVENTION 

Through a search of the EST database for homologies to the conserved catalytic 
kinase domain of protein kinases, hundreds of mammalian members of known and 
previously unidentified protein kinase families and groups have been identified as part of 
the present invention. Multiple alignment and parsimony analysis of the catalytic domain 
reveals that approximately half of these protein kinases cluster into 10 known groups, with 
the other half perhaps defining novel groups. Classification in this manner has proven 
highly accurate not only in predicting motifs present in the remaining non-catalytic portion 
of each protein, but also in their regulation, substrates, and signaling pathways. The 
present invention includes the partial or complete sequence of new protein kinases, their 
classification, predicted or deduced protein structure, and a strategy for elucidating their 
biologic and therapeutic relevance. 

Thus, a first aspect of the invention features an isolated, enriched, or punfied 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting 



SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO 
SEQ ID NO: 1 32, SEQ ID NO: 133, SEQ ID NO 
SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO 
SEQ ID NO: 147. SEQ ID NO: 1 48, SEQ ID NO 
SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO 
SEQ ID NO:l 57, SEQ ID NO: 1 58, SEQ ID NO 
SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO 
SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO 
SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO 
SEQ ID NO:l 82, SEQ ID NO:l 83, SEQ ID NO 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO 
SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO 
SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO 



124, SEQ ID NO:125, SEQ ID NO:126, 
129, SEQ ID NO: 130, SEQ ID NO:131, 
134, SEQ ID NO: 135, SEQ ID NO:136, 
139, SEQ ID NO: 140, SEQ ID NO:141, 
144, SEQ ID NO: 145, SEQ ID NO: 146, 
1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
154, SEQ ID NO: 155, SEQ ID NO: 156, 
1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , 
164, SEQIDNO:165. SEQIDN0:166, 
169, SEQ ID NO: 170, SEQ ID NO:171, 
174, SEQ ID NO: 175, SEQ ID NO: 176, 
1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
184, SEQ ID NO: 185, SEQ ID NO: 186, 
189, SEQ ID NO: 190, SEQ ID NO:191, 
194, SEQ ID NO: 195, SEQ ID NO: 196, 
199, SEQ ID NO:200, SEQ ID NO:201, 
204, SEQ ID NO:205, SEQ ID NO:206, 
209, SEQ ID NO:210, SEQ ID NO:21 1, 
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SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID N0:21 7, SEQ ID N0:2 1 8, SEQ ID N0:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , 
SEQ ID NO:222, SEQ ID NO:223 3 SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
5 SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234 f SEQ ID NO:235, SEQ ID NO:236, 

SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, 
and SEQIDNO:242. 

By "isolated" in reference to nucleic acid is meant a polymer of nucleotides 
conjugated to each other, including DNA_and .RNA, that is ispjated from a natural source 

1 0 or that is synthesized. The isolated nucleic acid of the present invention is unique in the 

sense that it is not found in a pure or separated state in nature. Use of the term "isolated" 
indicates that a naturally occurring sequence has been removed from its normal cellular 
(i.e., chromosomal) environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

15 the only nucleotide chain present, but that it is essentially free (about 90 - 95% pure at 

least) of non-nucleotide material naturally associated with it, and thus is distinguished 
from isolated chromosomes. 

By the use of the term "enriched" in reference to nucleic acid is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction (2 - 5 fold) of 

20 the total DNA or RNA present in the cells or solution of interest than in normal or 

diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply that 

25 there are no other DNA or RNA sequences present, just that the relative amount of the 

sequence of interest has been significantly increased. The term "significant" is used to 
indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term also does not imply that there is no 

30 DNA or RNA from other sources. The other source DNA may, for example, comprise 

DNA from a yeast or bacterial genome, or a cloning vector such as pUCl 9. This term 
distinguishes from naturally occurring events, such as viral infection, or tumor type 
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growths, in which the level of one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover only those situations in which a 
person has intervened to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in purified 
5 form. The term "purified" in reference to nucleic acid does not require absolute purity 

(such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural 
level this level should be at least 2-5 fold greater, e.g., in terms of mg/mL). Individual 
clones isolated from a cDNA library may be purified to electrophoretic homogeneity. The 

10 claimed DNA molecules obtained from these clones could be obtained directly from total 

DNA or from total RNA. The cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially purified naturally occurring substance 
(messenger RNA). The construction of a cDNA library from mRNA involves the creation 
of a synthetic substance (cDNA) and pure individual cDNA clones can be isolated from 

1 5 the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 

process which includes the construction of a cDNA library from mRNA and isolation of 
distinct cDNA clones yields an approximately 10 6 -fold purification of the native message. 
Thus, purification of at least one order of magnitude, preferably two or three orders, and 
more preferably four or five orders of magnitude is expressly contemplated. 

20 By a "kinase polypeptide" is meant 10 (preferably 20, more preferably 40, most 

preferably 75) or more contiguous amino acids set forth in an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128 s 
SEQ ID NO:129, SEQ ID NO:130, SEQ EDNO:131, SEQ ID NO:132, SEQ ID NO:133, 

25 SEQ ID NO: 1 34, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, 

SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, 
SEQ ID NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO:l 53, 
SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, 

30 SEQ ID NO: 1 59, SEQ ED NO:l 60, SEQ ID NO:l 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, 
SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ K) NO: 1 73, 
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SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, 
SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
5 SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 

SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO.205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQIDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 

10 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or functional 

15 derivatives thereof as described herein. For sequences for which the full-length sequence 

is not given, the remaining sequences can be determined using methods well-known to 
those in the art and are intended to be included in the invention. In certain aspects, 
polypeptides of 1 00, 200, 300 or more amino acids are preferred. The kinase polypeptide 
can be encoded by a full-length nucleic acid sequence or any portion of the full-length 

20 nucleic acid sequence, so long as a functional activity of the polypeptide is retained. By 
"functional" domain is meant any region of the polypeptide that may play a regulatory or 
catalytic role as predicted from amino acid sequence homology to other proteins or by the 
presence of amino acid sequences that may give rise to specific structural conformations 
(i.e., coiled-coils). For some purposes, polypeptide domains are preferred, including, but 

25 not limited to, N-terminal, catalytic/kinase and C-terminal. 

The amino acid sequence will be substantially similar to a sequence selected from 
the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
NO:124, SEQ IDNO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID 
NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID 

30 NO:l 34, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 

NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID 
NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO : 1 48, SEQ ID 
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NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID 
NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID 
NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID 
NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 

5 NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ID 
NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID 
NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID 
NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
-NOM89, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, SEQ DD 

1 0 NO: 1 94, SEQ ID NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, SEQ ED 
NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ED NO:215, SEQ ID N0:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID 

15 NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 

NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID N0.232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242, or the corresponding 

20 full-length amino acid sequence, or fragments thereof. A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ED 
NO:127, SEQ ED NO:128, SEQ ED NO:129, SEQ ED NO:130, SEQ ID NO:131, SEQ ID 
NO: 1 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID 

25 NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ ID NO:140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ID NO:146, SEQ ID 
NO: 147, SEQ ED NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ID NO:l 51, SEQ ID 
NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ ED NO:155, SEQ ED NO: 156, SEQ ID 
NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 

30 NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
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NO: 1 77, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID NO: 180, 


SEQ 


ID 


NO:181, 


SEQ 


ID 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID NO: 185, 


SEQ 


ID 


NO: 186, 


SEQ 


ID 


NO: 187, 


SEQ 


ID NO: 188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID NO: 190, 


SEQ 


ID 


N0:191, 


SEQ 


ID 


NO: 1 99, 


SEQ 


IDNO:193, 


SEQ 


ID 


NO: 194, 


SEQ 


IDNO:195, 


SEQ 


ID 


NO:196, 


SEQ 


ID 


NO: 1 97, 


SEQ 


IDNO:198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID NO:200, 


SEQ 


ID 


NO:201, 


SEQ 


ID 


NO:202, 


SEQ 


IDNO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID NO:205, 


SEQ 


ID 


NO.206, 


SEQ 


ID 


NO:207, 


SEQ 


ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


IDNO:210, 


SEQ 


ID 


NO:211, 


SEQ 


ID 


NO:212, 


SEQ 


IDNO:213, 


SEQ 


ID 


NO:214, 


SEQ 


IDNO:215, 


SEQ 


ID 


NO:216, 


SEQ 


ID 


NO:217, 


SEQ 


BDNO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID NO:220, 


SEQ 


ID 


NO:221, 


SEQ 


ID 


NO:222, 


SEQ 


ID NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


IDNO:225, 


SEQ 


ID 


NO:226, 


SEQ 


ID 


NO:227, 


SEQ 


IDNO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID NO:230, 


SEQ 


ID 


NO:231, 


SEQ 


ID 


NO:232, 


SEQ 


IDNO:233, 


SEQ 


ID 


NO:234, 


SEQ 


IDNO:235, 


SEQ 


ID 


NO:236, 


SEQ 


ID 


NO:237, 


SEQ 


1DN0:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID NO:240, 


SEQ 


ID 


NO:241, 


and SEQ 



ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 

1 5 and most preferably 99-1 00%) to a sequence selected from the group consisting of those 

set forth in SEQ ID NO: 1 22, SEQ ID NO: 123, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ 
ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ 
ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ 
ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ 

20 ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ 

ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ 
ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO: 155, SEQ 
ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ 
ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ 

25 ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ 
ID NO:171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 

30 ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ 
ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
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ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID N0:2 1 1 , SEQ ID NO:2 1 2, SEQ ID N0:2 1 3, SEQ ID N0:2 1 4, SEQ ID N0:2 15, SEQ 
ID N0:2 1 6, SEQ ID N0:2 1 7, SEQ ID N0:2 1 8, SEQ ID N0:2 1 9, SEQ ID NO:220, SEQ 
ID NO:22 1 , SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
5 ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ED N0.234, SEQ ID NO:235, SEQ 
ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:24 1 , and SEQ ID NO:242 or portions of or the entire corresponding full-length 
amino acid sequences. 

10 By "identity" is meant a property of sequences that measures their similarity or 

relationship. Identity is measured by dividing the number of identical residues between 
two sequences (either full-length or a defined domain) by the total number of residues in 
the known sequence, or the domain of the known sequence, and multiplying the product 
by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences 

15 that are less highly conserved, and have replacements and substitutions, have a lower 

degree of identity. "Gaps" are spaces in an alignment that can result from aligning a novel 
sequence with a known sequence when the novel sequence has additions or deletions of 
amino acids in comparison with the known sequence. These gaps do not factor into the 
assessment of % identity using the sbove calculation. 

20 Those skilled in the art will recognize that several computer programs are also 

available for determining sequence identity using standard parameters, for example, Blast 
(Altschul, et al (1997) Nucleic Acids Res. 25:3389-3402), Blast2 (Altschul, et al (1990) 
J.Mol. Biol. 215:403-410), and Smith-Waterman (Smith, et al. (1981) J. Mol. Biol. 
147:195-197). 

25 In preferred embodiments, the invention features isolated, enriched, or purified 

nucleic acid molecules encoding a kinase polypeptide comprising a nucleotide sequence 
that: (a) encodes a polypeptide having an amino acid sequence selected from the group 
consisting of those set forth in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 

30 ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 
ID NO: 1 40, SEQ ED NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ 
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ID NO: 145, SEQ ED NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ 
ID NO: 150, SEQ ED NO: 151, SEQ ED NO: 152, SEQ ED NO: 153, SEQ ED NO: 154, SEQ 
ID NO:155, SEQ ED NO:156, SEQ ED NO:157, SEQ ED NO:158, SEQ ID NO:159, SEQ 
ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163. SEQ ID NO: 164, SEQ 
5 ID N0.165. SEQ ID NO:166, SEQ ID NO:167, SEQ ED NO:168, SEQ ID NO:169, SEQ 

ED NO:170, SEQ ED NO:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ 
ID NO: 175, SEQ ED NO: 176, SEQ ED NO: 177, SEQ ED NO: 178, SEQ ED NO: 179, SEQ 
ED NO: 180, SEQ ID NO: 181, SEQ ED NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ 
ID NO: 185, SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ED NO: 189, SEQ 

10 ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ 

ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ 
ED NO:200, SEQ ID NO:201, SEQ ID NO.202, SEQ ED NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO.209, SEQ 
ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ED NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ 

15 ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ 

ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ED NO:230, SEQ ED NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ 
ED NO:235, SEQ ED NO:236, SEQ ID N0.237, SEQ ED NO:238, SEQ ED NO:239, SEQ 

20 ED NO:240, SEQ ED NO:241 , and SEQ ED NO:242, or the corresponding full-length 

amino acid sequence, or fragments thereof. A sequence that is substantially similar to a 
sequence selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ED NO:127, SEQ ID 
NO: 1 28, SEQ ED NO: 1 29, SEQ ED NO : 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 

25 NO:133, SEQ ED NO: 134, SEQ ED NO: 135, SEQ ED NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ED 
NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 

30 NO: 1 58, SEQ ID NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ED NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ID 
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NO: 173, 


SEQ 


ID NO: 174, 


SEQ 


ED 


NO: 175, 


SEQ 


ED 


NO: 176, 


SEQ 


ID NO: 177, SEQ ID 


NO: 178, 


SEQ 


ID NO: 179, 


SEQ 


ED 


NO: 180, 


SEQ 


ED 


NO:181, 


SEQ 


ED NO: 182, SEQ ED 


NO: 183, 


SEQ 


ID NO: 184, 


SEQ 


ED 


NO: 185, 


SEQ 


ID 


NO: 186, 


SEQ 


ID NO: 187, SEQ ID 


NO: 188, 


SEQ 


ID NO: 189, 


SEQ 


ED 


NO: 190, 


SEQ 


ID 


NO:191, 


SEQ 


ID NO: 199, SEQ ID 


NO: 193, 


SEQ 


ED NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID 


NO: 196, 


SEQ 


ID NO: 197, SEQ ID 


NO: 198, 


SEQ 


ID NO: 199, 


SEQ 


ED 


NO:200, 


SEQ 


ED 


NO:201, 


SEQ 


ID NO:202, SEQ ID 


NO:203, 


SEQ 


ID NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID 


NO:206, 


SEQ 


ID NO:207, SEQ ID 


NO:208, 


SEQ 


ID NO:209, 


SEQ 


ED 


NO:210, 


SEQ 


ID 


NO:211, 


SEQ 


IDNO:212,SEQ ID 


NO:213, 


SEQ 


ID NO:214, 


SEQ 


ED 


NO:215, 


SEQ 


ED 


NO:216, 


SEQ 


IDNO:217,SEQ ID 


NO:218, 


SEQ 


EDNO:219, 


SEQ 


ED 


NO:220, 


SEQ 


ED 


NO:221, 


SEQ 


ED NO:222, SEQ ID 


NO:223, 


SEQ 


ID NO:224, 


SEQ 


ED 


NO:225, 


SEQ 


ED 


NO:226, 


SEQ 


ED NO:227, SEQ ID 


NO:228, 


SEQ 


ED NO:229, 


SEQ 


ED 


NO:230, 


SEQ 


ED 


NO:231, 


SEQ 


ED NO:232, SEQ ID 


NO:233, 


SEQ 


ED NO:234, 


SEQ 


ED 


NO:235, 


SEQ 


ED 


NO:236, 


SEQ 


ED NO:237, SEQ ED 


NO:238, 


SEQ 


ED NO:239, 


SEQ 


ED 


NO:240, 


SEQ 


ED 


NO:241, 


and SEQ ID NO:242 will 



1 5 have at least 75% identity (preferably 90%, more preferably at least 95% and most 



preferably 99-100%) to the sequence selected from the group consisting of those set forth 
in SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID 





NO: 126, 


SEQ ID NO: 127, 


SEQ ID NO: 128, 


SEQ 


ID 


NO: 129, 


SEQ 


ID 


NO: 130, 


SEQ 


ID 




NO:131, 


SEQ ED NO: 132, 


SEQ ID NO: 133, 


SEQ 


ED 


NO: 134, 


SEQ 


ID 


NO:135, 


SEQ 


ID 


20 


NO:136, 


SEQ ID NO: 137, 


SEQ ED NO: 138, 


SEQ 


ID 


NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID 




NO:141, 


SEQ ID NO: 142, 


SEQ ID NO: 143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 


NO:145, 


SEQ 


ID 




NO: 146, 


SEQ ED NO: 147, 


SEQ ED NO: 148, 


SEQ 


ED 


NO: 149, 


SEQ 


ID 


NO:150, 


SEQ 


ID 




NO:151, 


SEQ ED NO: 152, 


SEQ ED NO: 153, 


SEQ 


ED 


NO: 154, 


SEQ 


ID 


NO:155, 


SEQ 


ID 




NO: 156, 


SEQ ED NO: 157, 


SEQ ED NO: 15 8, 


SEQ 


ED 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID 


25 


NO: 161, 


SEQ ED NO: 162, 


SEQ ED NO: 163, 


SEQ 


ID 


NO: 1 64, 


SEQ 


ID 


NO: 165. 


SEQ 


ID 




NO: 166, 


SEQ ED NO: 167, 


SEQ ED NO: 168, 


SEQ 


ED 


NO: 169, 


SEQ 


ED 


NO: 170, 


SEQ 


ID 




NO:171, 


SEQ ED NO: 172, 


SEQ ED NO: 173, 


SEQ 


ED 


NO: 174, 


SEQ 


ID 


NO: 175, 


SEQ 


ID 




NO:176, 


SEQ ED NO: 177, 


SEQ ED NO: 178, 


SEQ 


ED 


NO: 179, 


SEQ 


ED 


NO: 180, 


SEQ 


ID 




NO:181, 


SEQ ED NO: 182, 


SEQ ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


ID 


30 


NO: 186, 


SEQ ID NO: 187, 


SEQ ED NO: 188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


ID 




NO:191, 


SEQ ED NO: 199, 


SEQ ID NO: 193, 


SEQ 


ED 


NO: 194, 


SEQ 


ID 


NO:195, 


SEQ 


ID 




NO: 196, 


SEQ ID NO: 197, 


SEQ ID NO: 198, 


SEQ 


ED 


NO: 1 99, 


SEQ 


ID 


NO:200, 


SEQ 


ID 
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NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
5 NO:22 1 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED 

NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID 
NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) 

1 0 hybridizes under highly stringent conditions to the nucleotide molecule of (a) and encodes 

a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having an 
amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID 
NO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ ED NO:130, SEQ ID NO:131, SEQ ID 

15 NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 

NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 41 , SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ED NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 

20 NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID 
NO: 167, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

25 NO:182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

30 NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ IDNO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ DDNO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
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NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
5 ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 

A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQIDNO:126, SEQIDNO:127, SEQIDNO:128, SEQE)NO:129, SEQIDNO:130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, 

10 SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQ ED NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ED NO: 151, SEQ ED NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:1.55, 
SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 

15 SEQ ED NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. 

SEQ ED NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ID NO: 1 71 , SEQ ID NO: 172, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, 
SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 

20 SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 

SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ED NO: 1 95, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, 
SEQ ED NO:206, SEQ DD NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:210, 

25 SEQ ED NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, 

SEQ ED NO:2 1 6, SEQ DD NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, 
SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ED NO:231, SEQ DD NO:232, SEQ ED NO:233, SEQ DD NO:234, SEQ DD NO:235, 

30 SEQ ED NO:236, SEQ ID NO:237, SEQ DD NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ DD NO:241, and SEQ ED NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to the sequence of SEQ ID 
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NO: 122, SEQ ID NO: 123, SEQ ID NO:124, SEQ ED NO: 125, SEQ ID NO: 126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ED NO:129, SEQ ED NO:130 5 SEQ ID NO:131, SEQ ED 
NO: 1 32, SEQ ED NO: 1 33, SEQ ED NO: 1 34, SEQ ED NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ED NO: 141, SEQ ID 
5 NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 

NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID N0.151, SEQ ID 
NO:l 52, SEQ ID NO:l 53, SEQ ID NO:l 54, SEQ ID NO:l 55, SEQ ID NO:l 56, SEQ ID 
NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ED NO: 161, SEQ ID 
NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 

10 NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED 
NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID 
NO: 187, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO : 1 90, SEQ ED NO: 1 9 1 , SEQ ID 

15 NO: 1 99, SEQ ED NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, SEQ ID 

NO:197, SEQ ED NO:198 ( SEQ ED NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID 

20 NO:2 1 7, SEQ ED NO:2 1 8, SEQ ED NO:2 1 9, SEQ ED NO:220, SEQ ID NO:22 1 , SEQ ID 

NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 

25 ED NO:242, except that it lacks one or more, but not all, of a domain selected from the 

group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil structure region, a proline-rich region, a spacer region, an insert, and a C- 
terminal tail; (e) is the complement of the nucleotide sequence of (d); (f) encodes a 
polypeptide having an amino acid sequence selected from the group consisting of those set 

30 forth in SEQ ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 128, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID 
NO: 1 3 1 , SEQ ED NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID 
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NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: ] 38, SEQ ID NO: 139, SEQ ID NO: 1 40, SEQ ID 
NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ED NO: 145, SEQ ID 
NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID 
NO:151, SEQ ID NO: 152, SEQ ID NOT 53, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID 
NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID 
NO:161, SEQ ID NO:162, SEQ ID NO.T63, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID 
NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID 
NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID 
NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID 
NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:21 1, SEQ ED NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:2 1 6, SEQ ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID N0.229, SEQ ID NO.230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID N0.238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242, or the corresponding full-length amino acid sequence, or 
fragments thereof. (The domain demarcations of the polypeptides of the invention are 
indicated in Table 2 by reference to the kinase domain.) A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ED NO: 124, SEQ ED NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ED NOT 33, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ED 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ED NO: 150, SEQ ED NOT 51, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ED NO: 156, SEQ ID 
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NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ IDNO:161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO:176, SEQ ID 
5 NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID N0.185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 

1 0 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

1 5 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to the sequence selected from the group consisting of those 

20 set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 

ID NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ 
ID NO: 131, SEQ ED NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ 
ED NO: 136, SEQ ED NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ 
ED NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ 

25 ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ 

ID NO: 151, SEQ ID NO: 152, SEQ ED NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ 
ED NO: 1 56, SEQ ED NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 159, SEQ ID NO: 1 60, SEQ 
ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ 
ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ 

30 ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ 
ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ 



WO 00/73469 



18 



PCT/USOO/14842 



ID NO: 186, SEQ ID NO: 187, SEQ DD NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ 
ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID N0.198, SEQ ID NO:199, SEQ ID NO:200, SEQ 
ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 

5 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ 

ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

1 0 ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ tD NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ 
ED NO:241, and SEQ ED NO:242; (b) is the complement of the nucleotide sequence of (a); 
(c) hybridizes under highly stringent conditions to the nucleotide molecule of (a) and 
encodes a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having 

1 5 an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
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NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID N0.214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQE>NO:218, SEQEDNO:219, SEQIDNO:220, SEQ IDNO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
N0.227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 
those set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 
SEQ ID NO:136, SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQ ID NO: 1 41 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ DD NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, 
SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, 
SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. 
SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, 
SEQ ID NO: 1 71 , SEQ ED NO: 1 72, SEQ ID NO: 173, SEQ ED NO: 1 74, SEQ ED NO: 1 75, 
SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 
SEQ ID NO: 186, SEQ ID NO: 187, SEQ ED NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, 
SEQ ED NO:191, SEQ ED NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ID NO:195, 
SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, 
SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID N0:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, 
SEQ ID NO:231 , SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 
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SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ED NO:242 will have at least 75% identity (preferably 90%, 
moTe preferably at least 95% and most preferably 99-100%) to a domain of a polypeptide 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO: 1 24, SEQ ID NO: 125, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, 



SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131 
SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141 
SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146 
SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56. 
SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ED NO: 1 61 
SEQ ED NO: 164, SEQ ED NO: 165. SEQ ID NO: 166 
SEQ ED NO: 169, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 
SEQ ED NO: 1 74, SEQ ED NO: 175, SEQ ID NO: 1 76 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 
SEQ ED NO: 184, SEQ ID NO: 1 85, SEQ ID NO: 1 86 
SEQ ID NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 9 1 
SEQ ID NO: 194, SEQ ED NO: 195, SEQ ID NO: 196 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201 
SEQ ID NO:204, SEQ ID NO:205, SEQ ED NO:206 
SEQ ED NO:209, SEQ ID NO:210, SEQ ED NO:211 
SEQ ED NO:214, SEQ ID NO:215, SEQ ED NO:216 
SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221 
SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226 
SEQ ED NO:229, SEQ ID NO:230, SEQ ED NO:231 
SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236 



SEQ ID NO:132, SEQ ID NO:133, 
SEQ ID NO:137, SEQ ED NO:138, 
SEQ ED NO:142, SEQ ID NO:143, 
SEQ ED NO:147, SEQ ID NO:148, 
SEQ ID NO: 1 52, SEQ ED NO: 1 53, 
SEQ ED NO: 157, SEQ ED NO: 158, 
SEQ ID NO: 162, SEQ ID NO: 163, 
SEQ ED NO: 1 67, SEQ ID NO: 1 68, 
SEQ ID NO:172, SEQIDNO:173, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ED NO:l 82, SEQ ID NO:l 83, 
SEQ ID NO: 187, SEQ ID NO: 188, 
SEQ ID NO: 199, SEQ ED NO: 193, 
SEQ ED NO: 1 97, SEQ ID NO: 1 98, 
SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:207, SEQ ID NO:208, 
SEQ ED NO:212, SEQ ID NO:213, 
SEQ ID NO:217, SEQ ID NO:218, 
SEQ ED NO:222, SEQ ED NO:223, 
SEQ ID NO:227, SEQ ED NO:228, 
SEQ ID NO:232, SEQ ED NO:233, 
SEQ ED N0.237, SEQ ID NO:238, 
and SEQ ED NO:242, where the 



SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, 
domain is selected from the group consisting of an N-terminal domain, a catalytic domain, 
a C-terminal domain, a coiled-coil structure region, a proline-rich region, a spacer region, 
an insert, and a C-terminal tail; (g) is the complement of the nucleotide sequence of (f); (h) 
encodes a polypeptide having an amino acid sequence selected from the group consisting 
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of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
NO: 1 30, SEQ ED NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 133, SEQ ID NO: 1 34, SEQ ED 
NO: 135, SEQ ID NO:136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID 
NO:140, SEQ ED NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID 
NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 
NO:150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 
NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID 
NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID 

NO: 1 65. SEQ ED NO:l 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ DD NO: 1 72, SEQ DD NO: 1 73, SEQ ID NO: 1 74, SEQ ID 

NO: 1 75, SEQ ED NO:l 76, SEQ ID NO: 1 77, SEQ ID NO: 178, SEQ ID NO: 1 79, SEQ ID 
NO: 1 80, SEQ DD NO: 1 8 1 , SEQ ED NO: 1 82, SEQ DD NO: 1 83, SEQ DD NO: 1 84, SEQ ID 
NO: 1 85, SEQ DD NO: 1 86, SEQ DD NO: 1 87, SEQ DD NO: 1 88, SEQ ED NO: 1 89, SEQ DD 
NO: 190, SEQ ED NO:191, SEQ DD NO: 199, SEQ DD NO: 193, SEQ DD NO: 194, SEQ ED 
NO: 195, SEQ DD NO: 196, SEQ DD NO: 197, SEQ DD NO: 198, SEQ DD NO: 199, SEQ ID 
NO:200, SEQ DD NO:201, SEQ DD NO:202, SEQ DD NO:203, SEQ DD NO:204, SEQ DD 
NO:205, SEQ DD NO:206, SEQ DD NO:207, SEQ DD NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ DD N0.212, SEQ DD NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ DD NO:219, SEQ ID 
NO:220, SEQ DD NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231 , SEQ DD NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ DD 
NO:235, SEQ DD NO:236, SEQ ID NO:237, SEQ DD NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ DD NO:241, and SEQ DD NO:242, or the corresponding full-length amino 
acid sequence, or fragments thereof. A sequence that is substantially similar to a sequence 
selected from the group consisting of those set forth in SEQ DD NO:122, SEQ DD NO:123, 
SEQ ED NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ DD NO: 127, SEQ DD NO: 128, 
SEQ DD NO: 129, SEQ DD NO: 130, SEQ DD NO: 131, SEQ ID NO: 132, SEQ DD NO: 133, 
SEQ DD NO: 134, SEQ ED NO: 135, SEQ DD NO: 136, SEQ DD NO: 137, SEQ DD NO: 138, 
SEQ DD NO: 1 39, SEQ DD NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO : 1 43, 
SEQ DD NO: 144, SEQ ID NO: 145, SEQ DD NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, 
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SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID NO: 153, 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 
SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, 
SEQ ID NO:164, SEQ ID NO:165. SEQ ID N0:166, SEQ ID NO:167, SEQ ID NO:168, 
5 SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, 

SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ID NO: 1 79, SEQ ID NO:l 80, SEQ ID NO: 1 8 1 , SEQ ID NO:l 82, SEQ ID NO: 1 83, 
SEQ ID NO: 184, SEQ ID NO:185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO: 188, 
SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 

10 SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 

15 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 

SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 will have al 

20 least 75% identity (preferably 90%, more preferably at least 95% and most preferably 99- 

1 00%) to the sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ED NO: 130, SEQ ID NO:131, SEQ ID 
NO: 132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ED 

25 NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 

NO:142, SEQ ED NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID 

NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 
30 NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ ED NO: 1 66, SEQ ID 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
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NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 81 , SEQ ID 
NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
5 NO:197, SEQ ID NO:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 

1 0 NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ED 
NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ 
ED NO:242; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under 

15 highly stringent conditions to the nucleotide molecule of (a) and encodes a naturally 

occurring kinase polypeptide; (d) encodes a kinase polypeptide having an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 

20 NO:133, SEQ ED NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 

NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ED NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ED NO: 147, SEQ ED 
NO: 1 48, SEQ ED NO: 1 49, SEQ ED NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO: 1 52, SEQ ID 
NO: 153, SEQ ED NO: 154, SEQ ED NO: 155, SEQ ED NO: 156, SEQ ED N0.157, SEQ ID 

25 NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ED NO: 161, SEQ ED NO: 162, SEQ ID 

NO: 1 63, SEQ ED NO: 1 64, SEQ ID NO: 1 65. SEQ ED NO: 1 66, SEQ ED NO:l 67, SEQ ED 
NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 71 , SEQ ED NO: 1 72, SEQ ID 
NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO:l 77, SEQ ID 
NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 81 , SEQ ID NO: 1 82, SEQ ID 

30 NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO:l 87, SEQ ID 
NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID 
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NO: 198, SEQ ED NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
5 NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 

NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO.230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID N0.235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the 
1 0 corresponding full-length amino acid sequence, or fragments thereof. A sequence that is 

substantially similar to a sequence selected from the group consisting of those set forth in 
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SEQ ID NO:227, SEQ ID NO:22S, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242 will have at least 75% identity (preferably 90%, more preferably at 
least 95% and most preferably 99-100%) to the sequence of SEQ ID NO: 122, SEQ ID 
NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, SEQ ED 
NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ED NO: 146, SEQ ED NO: 147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ED NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 1 68, SEQ ED NO: 1 69, SEQ ID NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID 
NO: 1 73, SEQ ED NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED 
NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ED NO:202, SEQ ID 
NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ID 
NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID 
NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, 
except that it lacks one or more of the domains selected from the group consisting of a N- 
terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil structure region, a 
proline-rich region, a spacer region, an insert, and a C-terminal tail; or (i) is the 
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complement of the nucleotide sequence of (h). The domain demarcations of the 
polypeptides of the invention are indicated in Table 2 by reference to the kinase domain. 

The term "complement" refers to two nucleotides that can form multiple favorable 
interactions with one another. For example, adenine is complementary to thymine as they 

5 can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since 

they can form three hydrogen bonds. A nucleotide sequence is the complement of another 
nucleotide sequence if all of the nucleotides of the first sequence are complementary to all 
of the nucleotides of the second sequence. 

The term "domain" refers to a region of a polypeptide that contains a particular 

10 function. For instance, N-terminal or C-terminal domains of signal transduction proteins 

can serve functions including, but not limited to, binding molecules that localize the signal 
transduction molecule to different regions of the cell or binding other signaling molecules 
directly responsible for propagating a particular cellular signal. Some domains can be 
expressed separately from the rest of the protein and function by themselves, while others 

15 must remain part of the intact protein to retain function. The latter are termed functional 

regions of proteins and also relate to domains. 

The term "N-terminal domain" refers to the extracatalytic region located between 
the initiator methionine and the catalytic domain of the protein kinase. The N-terminal 
domain can be identified following a Smith-Waterman alignment of the protein sequence 

20 against the non-redundant protein database to define the N-terminal boundary of the 

catalytic domain. Depending on its length, the N-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose N-terminal 
domain has been shown to play a regulatory role is PAK65, which contains a CRIB motif 
used for Cdc42 and rac binding (Burbelo, P.D. et al (1995) J. Biol. Chem. 270, 29071- 

25 29074). The N-terminal domain of a protein kinase of the invention is that portion of the 

protein kinase to the amino-terminal side of the kinase domain where the kinase domain is 
identified in Table 2, herein. Further, in some cases, portions of the N-terminal domains 
of the protein kinases of the invention have not been identified since the entire sequence is 
not available. However, with the methods described herein, the full-length sequences of 

30 the kinases of the invention can be determined and using the approaches described herein 

the N-terminal domain can be identified. 
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The term "catalytic domain" or "kinase domain" refers to a region of the protein 
kinase that is typically 25-300 amino acids long and is responsible for carrying out the 
phosphate transfer reaction from a high-energy phosphate donor molecule such as ATP or 
GTP to itself (autophosphorylation) or to other proteins (exogenous phosphorylation). 

5 The catalytic domain of protein kinases is made up of 12 subdomains that contain highly 

conserved amino acid residues, and are responsible for proper polypeptide folding and for 
catalysis. The catalytic domain can be identified following a Smith-Waterman alignment 
of the protein sequence against the non-redundant protein database. The catalytic/kinase 
domains of the protein kinases of the invention are identified in Table 2, herein. Further, 

1 0 in some cases, the complete sequence of the catalytic/kinase domains of the protein 

kinases of the invention may not have been provided since the entire sequence is not 
available. However, with the methods described herein, the full-length sequences of the 
kinases of the invention can be determined, and using the approaches described herein, the 
catalytic/kinase domain can be identified. 

1 5 Th e term "catalytic activity", as used herein, defines the rate at which a kinase 

catalytic domain phosphorylates a substrate. Catalytic activity can be measured, for 
example, by determining the amount of a substrate converted to a phosphorylated product 
as a function of time. Catalytic activity can be measured by methods of the invention by 
holding time constant and determining the concentration of a phosphorylated substrate 

20 after a fixed period of time. Phosphorylation of a substrate occurs at the active-site of a 

protein kinase. The active-site is normally a cavity in which the substrate binds to the 
protein kinase and is phosphorylated. 

The term "substrate" as used herein refers to a molecule phosphorylated by a 
kinase of the invention. Kinases phosphorylate substrates on serine/threonine or tyrosine 

25 amino acids. The molecule may be another protein or a polypeptide. 

The term "C-terminal domain" refers to the region located between the catalytic 
domain and the carboxy-terminal amino acid residue of the protein kinase. The C- 
terminal domain can be identified by using a Smith-Waterman alignment of the protein 
sequence against the non-redundant protein database to define the C-terminal boundary of 

30 the catalytic domain or of any functional C-terminal extracatalytic domain. Depending on 
its length and amino acid composition, the C-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose C-terminal 
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domain may play a regulatory role is PAK3 which contains a heterotrimeric G b subunit- 
binding site near its C-terminus (Leeuw, T. et ah (1998) Nature, 391, 191-195). The C- 
terminal domain of a protein kinase of the invention is that portion of the protein kinase to 
the carboxy-terminal side of the kinase domain where the kinase domain is identified in 
5 Table 2, herein. In some cases, the C-terminal domains of the protein kinases of the 

invention have not been provided since the entire sequence is not available. However, 
with the methods described herein, the full-length sequences of the kinases of the 
invention can be determined, and using the approaches described herein, the C-terminal 
domain can be identified. 

10 The term "signal transduction pathway" refers to the molecules that propagate an 

extracellular signal through the cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The polypeptide molecules involved in 
signal transduction processes are typically receptor and non-receptor protein tyrosine 
kinases, receptor and non-receptor protein phosphatases, SRC homology 2 and 3 domains, 

1 5 phosphotyrosine binding proteins (SRC homology 2 (SH2) and phosphotyrosine binding 

(PTB and PH) domain containing proteins), proline-rich binding proteins (SH3 domain 
containing proteins), nucleotide exchange factors, and transcription factors. 

The term "coiled-coil structure region" as used herein, refers to a polypeptide 
sequence that has a high probability of adopting a coiled-coil structure as predicted by 

20 computer algorithms such as COILS (Lupas, A. (1 996) Meth. Enzymology 266:5 1 3-525). 

Coiled-coils are formed by two or three amphipathic a-helices in parallel. Coiled-coils can 
bind to coiled-coil domains of other polypeptides resulting in homo- or heterodimers 
(Lupas, A. (1991) Science 252:1 162-1 164). Coiled-coil-dependent oligomerization has 
been shown to be necessary for protein function including catalytic activity of 

25 serine/threonine kinases (Roe, J. et al (1997) J. Biol. Chem. 272:5838-5845). Coiled-coil 

regions in the proteins of the invention can be identified using these methods. They may 
be present as sub-domains of the N-terminal, kinase, or C-terminal domains of the 
polypeptides of the invention. 

The term "proline-rich region" as used herein, refers to a region of a protein kinase 

30 whose proline content over a given amino acid length is higher than the average content of 

this amino acid found in proteins {i.e., >10%). Proline-rich regions are easily discernable 
by visual inspection of amino acid sequences and quantitated by standard computer 
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sequence analysis programs such as the DNAStar program EditSeq. Proline-rich regions 
have been demonstrated to participate in regulatory protein -protein interactions. Among 
these interactions, those that are most relevant to this invention involve the 4< PxxP" proline 
rich motif found in certain protein kinases {i.e., human PAK1) and the SH3 domain of the 
adaptor molecule Nek (Galisteo, ML. ei al (1996) J. Biol. Chem. 271 :20997-21000). 
Other regulatory interactions involving "PxxP" proline-rich motifs include the WW 
domain (Sudol, M. (1996) Prog. Biophys. Mol. Bio. 65:1 13-132). Proline rich regions in 
the proteins of the invention can be identified using these methods. They may be present 
..as subrdomains of the N-terminal, kinase, or C-terminal domains of the polypeptides of 
the invention. 

The term "spacer region" as used herein, refers to a region of the protein kinase 
located between predicted functional domains. The spacer region has no detectable 
homology to any amino acid sequence in the database, and can be identified by using a 
Smith-Waterman alignment of the protein sequence against the non-redundant protein 
database to define the C- and N-terminal boundaries of the flanking functional domains. 
Spacer regions may or may not play a fundamental role in protein kinase function. 
Precedence for the regulatory role of spacer regions in kinase function is provided by the 
role of the sre kinase spacer in inter-domain interactions (Xu, W. et al (1997) Nature 
385:595-602). Spacer regions in the proteins of the invention can be identified using these 
methods. They may be present as sub-domains of the N-terminal, kinase, or C-terminal 
domains of the polypeptides of the invention. 

The term "insert" as used herein refers to a portion of a protein kinase that is 
absent from a close homolog. Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Smith-Waterman sequence alignment of the 
protein sequence against the non-redundant protein database, or by means of a multiple 
sequence alignment of homologous sequences using the DNAStar program Megalign. 
Inserts may play a functional role by presenting a new interface for protein-protein 
interactions, or by interfering with such interactions. Insert regions in the proteins of the 
invention can be identified using these methods. They may be present as sub-domains of 
the N-terminal, kinase, or C-terminal domains of the polypeptides of the invention. 
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The term "C-terminal tail" as used herein, refers to a C-terminal domain of a 
protein kinase, that by homology extends or protrudes past the C-terminal amino acid of 
its closest homolog. C-terminal tails can be identified by using a Smith-Waterman 
sequence alignment of the protein sequence against the non-redundant protein database, or 
5 by means of a multiple sequence alignment of homologous sequences using the DNAStar 

program Megalign. Depending on its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. C-terminal tail regions in the proteins of the invention 
can be identified using these methods. They may be present as sub-domains of the N- 
terminal, kinase, or C-terminal domains of the polypeptides of the invention. 

1 o Various low or high stringency hybridization conditions may be used depending 

upon the specificity and selectivity desired. These conditions are well-known to those 
skilled in the art. Under stringent hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of 
nucleic acids having more than 1 or 2 mismatches out of 20 contiguous nucleotides, more 

15 preferably, such conditions prevent hybridization of nucleic acids having more than 1 or 2 

mismatches out of 50 contiguous nucleotides, most preferably, such conditions prevent 
hybridization of nucleic acids having more than 1 or 2 mismatches out of 1 00 contiguous 
nucleotides. In some instances, the conditions may prevent hybridization of nucleic acids 
having more than 5 mismatches in the full-length sequence. 

20 By stringent hybridization assay conditions is meant hybridization assay conditions 

at least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaH 2 P0 4 , pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 5X Denhart 
solution at 42 °C overnight; washing with 2X SSC, 0.1% SDS at 45 °C; and washing with 
0.2X SSC, 0.1%> SDS at 45 °C. Under some of the most stringent hybridization assay 

25 conditions, the second wash can be done with 0.1X SSC at a temperature up to 70 °C (pg. 

421,Bergere/a/. H987) Guide to Molecular Cloning Techniques , Meth. Enzym. vol. 152, 
hereby incorporated by reference herein including any figures, tables, or drawings.). 
However, other applications may require the use of conditions falling between these sets 
of conditions. Methods of determining the conditions required to achieve desired 

30 hybridizations are well-known to those with ordinary skill in the art, and are based on 

several factors, including but not limited to, the sequences to be hybridized and the 
samples to be tested. 
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In other preferred embodiments, the invention features isolated, enriched, or 
purified nucleic acid molecules encoding kinase polypeptides, further comprising a vector 
or promoter effective to initiate transcription in a host cell. The invention also features 
recombinant nucleic acid, preferably in a cell or an organism. The recombinant nucleic 
5 acid may contain a sequence selected from the group consisting of those set forth in SEQ 

ID NO:l , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO:l 1 , SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ED NO: 14, SEQ ID NO: 15, SEQ ED NO: 16, SEQ ED NO:17, 
SEQ ED NO: 18, SEQ ED NO: 19, SEQ ED NO:20, SEQ ED NO:21, SEQ ED NO:22, SEQ 

1 0 ID NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ED NO:26, SEQ ED NO:27, SEQ ED 

NO:28, SEQ ID NO:29, SEQ ED NO:30, SEQ ED NO:3 1 , SEQ ED NO:32, SEQ ED NO:33, 
SEQ ED NO:34, SEQ ID NO:35, SEQ ED NO:36, SEQ ED NO:37, SEQ ED NO:38, SEQ 
ID NO:39, SEQ ED NO:40, SEQ ED NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44, SEQ ED NO:45, SEQ ED NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, 

15 SEQ ED NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ 

ID NO:55, SEQ ED NO:56, SEQ ED NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ED NO:61, SEQ ED NO:62, SEQ ED NO:63, SEQ ID NO:64, SEQ ED NO:65, 
SEQ ED NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ 
ED NO:71 , SEQ ED NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 

20 NO:76, SEQ ED NO:77, SEQ ED NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ID NO:81 , 
SEQ ED NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ED NO:86, SEQ 
ID NO:87, SEQ ED NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ED NO:91, SEQ ED 
NO:92, SEQ ED NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, 
SEQ ED NO:98, SEQ ED NO:99, SEQ ED NO: 100, SEQ ED NO: 101, SEQ ED NO:102, 

25 SEQ ED NO: 103, SEQ ED NO: 104, SEQ ED NO: 105, SEQ ED NO: 106, SEQ ED NO:107, 
SEQ ED NO: 1 08, SEQ ED NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED NO: 1 1 1 , SEQ ID NO: 1 1 2, 
SEQ ED NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, 
SEQ ED NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ED NO: 120, and SEQ ID NO: 1 2 1 , or a functional 
derivative thereof and a vector or a promoter effective to initiate transcription in a host 

30 cell. The recombinant nucleic acid can alternatively contain a transcriptional initiation 
region functional in a cell, a sequence complementary to an RNA sequence encoding a 
kinase polypeptide and a transcriptional termination region functional in a cell. Specific 
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vectors and host cell combinations are discussed herein. The recombinant nucleic acid can 
also contain the full-length sequence encoding the protein kinase, or a domain, for 
example. 

The term "vector" relates to a single or double-stranded circular nucleic acid 
5 molecule that can be transfected into cells and replicated within or independently of a cell 

genome. A circular double-stranded nucleic acid molecule can be cut and thereby 
linearized upon treatment with restriction enzymes. An assortment of nucleic acid vectors, 
restriction enzymes, and the knowledge of the nucleotide sequences cut by restriction 
enzymes are readily available to those skilled in the art. A nucleic acid molecule encoding 

10 a kinase can be inserted into a vector by cutting the vector with restriction enzymes and 

ligating the two pieces together. 

The term "transfecting" defines a number of methods to insert a nucleic acid vector 
or other nucleic acid molecules into a cellular organism. These methods involve a variety 
of techniques, such as treating the cells with high concentrations of salt, an electric field, 

1 5 detergent, or DMSO to render the outer membrane or wall of the cells permeable to 

nucleic acid molecules of interest or use of various viral transduction strategies. 

The term "promoter" as used herein, refers to nucleic acid sequence needed for 
gene sequence expression. Promoter regions vary from organism to organism, but are well 
known to persons skilled in the art for different organisms. For example, in prokaryotes, 

20 the promoter region contains both the promoter (which directs the initiation of RNA 

transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis initiation. Such regions will normally include those 5'-non-coding 
sequences involved with initiation of transcription and translation, such as the TATA box, 
capping sequence, CAAT sequence, and the like. 

25 In preferred embodiments, the isolated nucleic acid comprises, consists essentially 

of, or consists of a nucleic acid sequence set forth in SEQ ID NO:l , SEQ ID NO:2, SEQ 
ID NO:3, SEQ ID NO:4 5 SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 
NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, 

30 SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ED NO:24, SEQ 
ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 
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SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ED NO:39, SEQ ID NO:40, SEQ 
ID N0:41, SEQ DO NO:42, SEQ ID NO:43, SEQ ED NO:44, SEQ ID NO:45, SEQ ID 
NO:46, SEQ ED NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:51 , 
SEQ ED NO:52, SEQ ED NO:53, SEQ ED NO:54, SEQ ED NO:55, SEQ ED NO:56, SEQ 
ID NO:57, SEQ ID NO:58, SEQ ED NO:59, SEQ ED NO:60 ; SEQ ED N0:61, SEQ ED 
NO:62, SEQ ID NO:63, SEQ ED NO:64, SEQ ED NO:65, SEQ ED NO:66, SEQ ID NO:67, 
SEQ ID NO:68, SEQ ED NO:69, SEQ ID NO:70, SEQ ED N0:71, SEQ ID NO:72, SEQ 
ED NO:73, SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ED NO:77, SEQ ED 
NO:78, SEQ ED NO:79, SEQ ID NO:80, SEQ ID N0:8 1 , SEQ ED NO:82, SEQ ED NO:83, 
SEQ ED NO:84, SEQ ED NO:85, SEQ ID NO:86, SEQ ED N<D:87, SEQ ID NO:88, SEQ 
ID NO:89, SEQ ID NO:90, SEQ ED N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ ED NO:98, SEQ ED NO:99, 
SEQ ED NO: 100, SEQ ID NO: J 01, SEQ ID NO: 102, SEQ ED NO: 103, SEQ ID NO: 104, 
SEQ ED NO: 105, SEQ DD NO:106, SEQ ID NO: 107, SEQ ED NO: 108, SEQ ED NO: 109, 
SEQ ED NO: 11 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3 , SEQ ID NO: 1 1 4, 
SEQ ED NO:l 15, SEQ DD NO:l 16, SEQ ED NO:l 17, SEQ ED NO:l 18, SEQ ID NO: 1 19, 
SEQ DD NO:120, and SEQ DD NO:121, or the corresponding full-length sequence, encodes 
an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ED NO: 1 25, SEQ DD NO: 1 26, SEQ ED 
NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ED NO: 130, SEQ ED NO: 131, SEQ DD 
NO:132, SEQ ID NO:133, SEQ DD NO:134, SEQ DD NO:135, SEQ DD NO:136, SEQ ID 
NO: 137, SEQ DD NO: 13 8, SEQ ED NO: 139, SEQ ED NO: 140, SEQ DD NO: 141, SEQ ID 
NO:142, SEQ ED NO:143, SEQ ID NO:144, SEQ DD NO:145, SEQ DD NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ DD NO: 1 49, SEQ DD NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ DD 
NO: 157, SEQ ED NO: 158, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 6 1 , SEQ ID 
NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65 . SEQ ED NO: 1 66, SEQ ID 
NO: 1 67, SEQ DD NO: 1 68, SEQ DD NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ED NO: 1 73, SEQ DD NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ DD NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ED 
NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ DD NO: 1 90, SEQ DD NO: 1 9 1 , SEQ DD 
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NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ID NO: 196, SEQ ID 
NO:197, SEQ ID NO: 198, SEQ ED NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
5 NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

NO:217, SEQ CD NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID 
NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

10 NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ 
ED NO:242, or the corresponding full-length amino acid sequence, a functional derivative 
thereof, or at least 10, 20, 40, 50, 75, 100, 200, 300 or 500 contiguous amino acids of a 
sequence selected from the group consisting of those set forth in SEQ ED NO:122, SEQ ID 
NO:123, SEQ ED NO:124, SEQ ED NO:125, SEQ ED NO:126, SEQ ID NO:127, SEQ ID 

15 NO: 1 28, SEQ ED NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 131, SEQ ID NO: 1 32, SEQ ID 

NO:133, SEQ ID N0.134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID NO:137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ EDNO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ED NO:149, SEQ ED NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID 

20 NO:153, SEQ ID NO:154, SEQ ED NO:155, SEQ ID N0.156, SEQ ID NO:157, SEQ ID 

NO: 158, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ED NO: 1 61 , SEQ ED NO: 1 62, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ED NO: 165. SEQ ED NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ED NO: 169, SEQ ED NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ID 
NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ED NO: 176, SEQ ED NO: 177, SEQ ID 

25 NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 81 , SEQ ED NO: 1 82, SEQ ED 

NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ED NO: 1 87, SEQ ID 
NO: 188, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 1 93, SEQ ED NO: 1 94, SEQ ED NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO:198, SEQ ED NO:199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID 

30 NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID 
NO:208, SEQ ED NO:209, SEQ ED NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID NO:2 1 2, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
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NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ID 
NO:223 5 SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED 
NO:228, SEQ ID NO:229, SEQ ID NO:230 5 SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ED NO:236, SEQ ID NO:237, SEQ ID 
5 NO:238 3 SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the 

corresponding full-length sequences or derivatives thereof. The nucleic acid may be 
isolated from a natural source by cDNA cloning or by subtractive hybridization. The 
natural source may be mammalian, preferably human, blood, semen, or tissue, and the 
nucleic acid may be synthesized by the triester method or by using an automated DNA 

10 synthesizer. 

The term "mammal" refers preferably to such organisms as mice, rats, rabbits, 
guinea pigs, sheep, and goats, more preferably to cats, dogs, monkeys, and apes, and most 
preferably to humans. 

In yet other preferred embodiments, the nucleic acid is a conserved or unique 

1 5 region, for example those useful for: the design of hybridization probes to facilitate 

identification and cloning of additional polypeptides, the design of PCR probes to 
facilitate cloning of additional polypeptides, obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions", are meant regions present on two or more 

20 nucleic acids encoding a kinase polypeptide, to which a particular nucleic acid sequence 

can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acid encoding kinase polypeptides are 
provided in Berger et al (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. 
vol. 152, hereby incorporated by reference herein in its entirety, including any drawings, 

25 figures, or tables. Preferably, conserved regions differ by no more than 5 out of 20 

nucleotides, even more preferably 2 out of 20 nucleotides or most preferably 1 out of 20 
nucleotides. 

By "unique nucleic acid region" is meant a sequence present in a nucleic acid 
coding for a kinase polypeptide that is not present in a sequence coding for any other 
30 naturally occurring polypeptide. Such regions preferably encode 10 (preferably 25, more 
preferably 50, most preferably 75) or more contiguous amino acids selected from the 
group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
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SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, 
SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 
SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

5 SEQ ID NO: 145, SEQ ED NO: 146, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, 

SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO: 159, 
SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ED NO: 1 62, SEQ ED NO: 1 63, SEQ ED NO: 1 64, 
SEQ ED NO: 1 65. SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 

10 SEQ ID NO: 1 70, SEQ ED NO: 1 71 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 

SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO:l 85, SEQ ID NO:l 86, SEQ ID NO:l 87, SEQ ID NO:l 88, SEQ ID NO:l 89, 
SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 

15 SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 

SEQ ID NO:200, SEQ ID NO.201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:2 1 5, SEQ ED NO:2 1 6, SEQ ED NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, 

20 SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, 

SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ED NO:229, 
SEQ ID NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ED NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ED NO:241, and SEQ ED NO:242, or functional derivatives thereof. 

25 In particular, a unique nucleic acid region is preferably of mammalian origin and 

preferably human. 

A second aspect of the invention features a nucleic acid probe for the detection of 
nucleic acid encoding a kinase polypeptide in a sample, wherein said polypeptide is 
selected from the group consisting of SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, 
30 SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, 
SEQ ID NO: 1 30, SEQ ED NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO:l 33, SEQ ID NO: 1 34, 
SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 
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SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ED NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ED NO:149, 
SEQ ED NO: 150, SEQ ED NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO:154, 
SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ED NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ED NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, 
SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, 
SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO:l 83, SEQ ID NO: 1 84, 
SEQ ID NO: 185, SEQ ID NO: 186, SEQ ED NO: 187, SEQ ED NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ED NO:193, SEQ ID N0:194, 
SEQ ID NO:195, SEQ ID NO: 196, SEQ ID NO:197, SEQ ED NO:198, SEQ ID NO:199, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ED NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, 
SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ED NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226 ; SEQ ID NO:227, SEQ ID N0.228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID N0.238, SEQ ID N0.239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ED N0:242. Preferably, the nucleic acid 
probe encodes a kinase polypeptide that is a fragment of the protein encoded by an amino 
acid sequence selected from the group consisting of those set forth in SEQ ED NO: 122, 
SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, 
SEQ ID NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, 
SEQ ED NO:333, SEQ ED NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID NO:137, 
SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ED NO:141, SEQ ED NO:142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ED NO:146, SEQ ID NO:147, 
SEQ ID NO: 148, SEQ ED NO: 149, SEQ ED NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, 
SEQ ED NO: 153, SEQ ID NO: 154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ID NO: 157, 
SEQ ID NO:158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, 
SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, 
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SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO. 172, 
SEQ ID NO: 173, SEQ IDNO:174, SEQ ID NO: 175, SEQ ID NO:176, SEQ ID NO: 177, 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
SEQ ID NO: 183, SEQ 1DN0:184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, 
5 SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, 
SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
.SEQ ID_NO:208,SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 

10 SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 

SEQ ID NO.218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID N0.224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 

15 SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 

NO:242, or the corresponding full-length amino acid sequences. The nucleic acid probe 
contains a nucleotide base sequence that will hybridize to a sequence selected from the 
group consisting of those set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ED NO:8, SEQ ID NO:9, 

20 SEQ ID NO: 1 0, SEQ ED NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 1 4, SEQ 

ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:4 1 , 
SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ 
ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1 , SEQ ID 
NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, 
SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ 

30 ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID 

NO:68, SEQ ID NO:69, SEQ ED NO:70, SEQ ED NO:71, SEQ ED NO:72, SEQ ED NO:73, 
SEQ ID NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ID NO:77, SEQ ED NO:78, SEQ 
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ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, 
SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ 
ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID 
NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID 
NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ED NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID 
NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ED NO: 1 1 9, SEQ ID 
NO: 120, and SEQ ED NO:121, or the corresponding full-length sequence, or a functional 
derivative thereof. 

In preferred embodiments, the nucleic acid probe hybridizes to nucleic acid 
encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino 
acids of a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 1 32, SEQ ED NO: 1 33, SEQ ID NO: 1 34, SEQ ED NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
NO: 1 37, SEQ ED NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ ID 
NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ID 
NO:l 52, SEQ ED NO: 1 53, SEQ ED NO: 1 54, SEQ ED NO: 1 55, SEQ ED NO: 1 56, SEQ ID 
NO: 1 57, SEQ ED NO: 1 58, SEQ ED NO: 1 59, SEQ ED NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ED NO: 1 65 . SEQ ID NO: 1 66, SEQ ED 
NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, SEQ ED NO: 1 7 1 , SEQ ED 
NO: 1 72, SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ED NO: 1 76, SEQ ED 
NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 1 82, SEQ ED NO: 1 83, SEQ ED NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO:187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO: 197, SEQ ED NO: 198, SEQ ED NO. 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ID 



WO 00/73469 



PCT/US00/14842 



40 

NO:21 7, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226 5 SEQ ID 
NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO.230, SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
5 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 5 or the corresponding full-length amino acid sequence, or functional 
derivatives thereof. 

Methods for using the probes include detecting the presence or amount of kinase 
RNA in a sample by contacting the sample with a nucleic acid probe under conditions 

10 such that hybridization occurs and detecting the presence or amount of the probe bound to 

kinase RNA. The nucleic acid duplex formed between the probe and a nucleic acid 
sequence coding for a kinase polypeptide may be used in the identification of the sequence 
of the nucleic acid detected (Nelson et ai, in Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed. 5 p. 275, 1992, hereby incorporated by reference 

1 5 herein in its entirety, including any drawings, figures, or tables). Kits for performing such 

methods may be constructed to include a container means having disposed therein a 



nucleic acid probe. 

In a third aspect, the invention describes a recombinant cell or tissue comprising a 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting of 
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SEQ ID NO:l 87, SEQ ID NO:l 88, SEQ ID NO:l 89, SEQ ID NO: 1 90, SEQ ID NO: 191 , 
SEQ ID NO: 199, SEQ ID NO:193, SEQ ID NO: 194, SEQ ID NO:195, SEQ ID NO: 196, 
SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ED NO:201, 
SEQ ID NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ED NO:21 1, 
SEQ ED NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ED NO:2 1 6, 
SEQ EDNO:217, SEQ EDNO:218, SEQ EDNO:219, SEQ EDNO:220, SEQ IDN0.221, 
SEQ ED N0.222, SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ED NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:231 , 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242. In such cells, the nucleic acid may be under the control of the 
genomic regulatory elements, or may be under the control of exogenous regulatory 
elements including an exogenous promoter. By "exogenous" it is meant a promoter that is 
not normally coupled in vivo transcriptionally to the coding sequence for the kinase 
polypeptides. 

The polypeptide is preferably a fragment of the protein encoded by an amino acid 
sequence selected from the group consisting of those set forth in SEQ ED NO: 122, SEQ ID 
NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID 
NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ED NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 
NO: 1 33, SEQ ED NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ED NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ED NO: 1 57, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ED NO: 169, SEQ ID NO: 170, SEQ ED NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 78, SEQ ED NO: 1 79, SEQ ED NO: 1 80, SEQ ED NO: 1 81 , SEQ ED NO: 1 82, SEQ ID 
NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ED NO: 187, SEQ ID 
NO: 1 88, SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID 
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NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID N0.197, SEQ ID 
NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
5 NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:2I8, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

10 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the 

corresponding full-length amino acid sequence. By "fragment," is meant an amino acid 
sequence present in a kinase polypeptide. Preferably, such a sequence comprises at least 
10, 20, 40, 50, 75, 100, 200, or 300 contiguous amino acids a sequence selected from the 
group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 

15 SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 

SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID N0.134, 
SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 
SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, 

20 SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, 
SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO. 1 58, SEQ ID NO: 1 59, 
SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 
SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, 

25 SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 

SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 
SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, 
SEQ ID NO: 195, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 

30 SEQ ID NO:200, SEQ ID NO.201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 

SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
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SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:2l9, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or of the corresponding full- 
length amino acid sequence, or a functional derivative thereof. 

In a fourth aspect, the invention features an isolated, enriched, or purified kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ 
ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ 
ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 138, SEQ 
ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 
ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ 

ID NO: 1 54, SEQ ED NO: 1 55, SEQ ED NO: 1 56, SEQ ED NO: 1 57, SEQ ID NO: 1 58, SEQ 
ID NO: 1 59, SEQ ED NO: 1 60, SEQ ID NO: 1 61 , SEQ ED NO: 1 62, SEQ ID NO: 1 63, SEQ 
ID NO:164, SEQ ID NO:165. SEQ ED NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ 
ED NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 
ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ED NO: 177, SEQ ID NO: 178, SEQ 
ID NO:l 79, SEQ ED NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ 
ID NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ 
ID NO:189, SEQ ID NO:190, SEQ ED NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ 
ID NO: 199, SEQ IDNO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ 
ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ 
ID NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ 
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ID NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ED NO:240, SEQ ID NO:241 , and SEQ ID NO:242. 

By "isolated" in reference to a polypeptide is meant a polymer of amino acids (2 or 
more amino acids) conjugated to each other, including polypeptides that are isolated from 
5 a natural source or that are synthesized. The isolated polypeptides of the present invention 

are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

10 the only amino acid chain present, but that it is essentially free (about 90 - 95% pure at 

least) of non-amino acid material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2 - 5 fold) of the 
total amino acid sequences present in the cells or solution of interest than in normal or 

15 diseased cells or in the cells from which the sequence was taken. This could be caused by 

a person by preferential reduction in the amount of other amino acid sequences present, or 
by a preferential increase in the amount of the specific amino acid sequence of interest, or 
by a combination of the two. However, it should be noted that enriched does not imply 
that there are no other amino acid sequences present, just that the relative amount of the 

20 sequence of interest has been significantly increased. The term significant here is used to 

indicate that the level of increase is useful to the person making such an increase, and 
generally means an increase relative to other amino acid sequences of about at least 2-fold, 
more preferably at least 5- to 10-fold or even more. The term also does not imply that 
there is no amino acid sequence from other sources. The other source of amino acid 

25 sequences may, for example, comprise amino acid sequence encoded by a yeast or 

bacterial genome, or a cloning vector such as pUC19. The term is meant to cover only 
those situations in which man has intervened to increase the proportion of the desired 
amino acid sequence. 

It is also advantageous for some purposes that an amino acid sequence be in 

30 purified form. The term "purified" in reference to a polypeptide does not require absolute 
purity (such as a homogeneous preparation); instead, it represents an indication that the 
sequence is relatively purer than in the natural environment. Compared to the natural level 
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this level should be at least 2-5 fold greater (e.g., in terms of mg/mL). Purification of at 
least one order of magnitude, preferably two or three orders, and more preferably four or 
five orders of magnitude is expressly contemplated. The substance is preferably free of 
contamination at a functionally significant level, for example 90%, 95%, or 99% pure. 

5 In preferred embodiments, the kinase polypeptide is a fragment of the protein 

encoded by an amino acid sequence selected from the group consisting of those set forth in 
SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 

0 SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, 

SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, 
SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , 

5 SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 3 65. SEQ ID NO: 166, 
SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , 
SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 

10 SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , 

SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, 
SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , 

15 SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 

SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED NO:235, SEQ ID NO:236, 

50 SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242, or the corresponding full-length amino acid sequences. Preferably, 
the kinase polypeptide contains at least 10, 20, 40, 50, 75, 100, 200, or 300 contiguous 
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amino acids a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID 
5 NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ IDNO:150, SEQ ID NO:151, SEQ ID 
NO: 1 52, SEQ ID NO: 1 53 , SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
..NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO:161, SEQ ID 

10 NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO : 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 
NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ED NO: 185, SEQ ED NO: 186, SEQ ID 

15 NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ED 
NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED 
NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ID 
NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID 
NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID 

20 NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 

NO:217, SEQ ID NO:218, SEQ ED N0:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ED NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ID 

25 NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 

ID NO:242, or the corresponding full-length amino acid sequence, or a functional 
derivative thereof. 

In preferred embodiments, the kinase polypeptide comprises an amino acid 
sequence having (a) an amino acid sequence selected from the group consisting of those 
30 set forth in SEQ ED NO:122, SEQ ID N0.123, SEQ ID NO:124, SEQ ED NO:125, SEQ 
ED NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ 
ED NO:131, SEQ IDNO:132, SEQ ED NO:133, SEQ ED NO:134, SEQ ID NO:135, SEQ 
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ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ 
ED NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ED NO: 145, SEQ 
ID NO: 146, SEQ ED NO: 147, SEQ ED NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ 
ID NO: 151, SEQ ED NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, SEQ ID NO: 155, SEQ 
5 ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ED NO: 160, SEQ 

ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ED NO: 1 64, SEQ ED NO: 1 65 . SEQ 
ID NO: 166, SEQ ID NO: 167, SEQ ED NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ 
ID NO: 171 , SEQ ID NO: 1 72, SEQ ID NO:l 73, SEQ ED NO:l 74, SEQ ID NO:l 75, SEQ 
ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ID NO: 1 80, SEQ 

10 ID NO: 181, SEQ ID NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ID NO: 185, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ 
ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ 
ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, SEQ 
ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ 

15 ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:2 1 0, SEQ 

ID NO:21 1 , SEQ ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ 
ID NO:2 1 6, SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ED NO:2 1 9, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

20 ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ DD NO:242; (b) an amino acid sequence selected from the group 
consisting of those set forth in SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ 
ID NO: 1 25, SEQ ED NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 1 2 8, SEQ ID NO: 1 29, SEQ 

25 ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ED NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ 
ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 13 8, SEQ ID NO: 139, SEQ 
IDNO:140, SEQ ID NO: 141, SEQ ID N0.142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ 
ID NO: 145, SEQ ID NO: 146, SEQ ED NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ 
ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ 

30 ID NO: 1 55, SEQ ID NO: 1 56, SEQ ED NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ 
ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ 
ID NO: 1 65 . SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, SEQ 
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ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ 
ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ED NO: 179, SEQ 
ID NO: 1 80, SEQ ID NO: 1 81 , SEQ ID NO:l 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ 
ID NO: 185, SEQ ID NO: 186, SEQ ED NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ 
ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ 
ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ 
ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 
ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, except that it lacks one or more, but 
not all, of a domain selected from the group consisting of an N-terminal domain, a 



catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; (c) an amino acid sequence of a domain of 
a polypeptide selected from the group consisting of those set forth in SEQ ID NO: 122, 
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SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, 
SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, 
SEQ ID NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ED NO:209, SEQ BDNO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ 1DN0:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID N0:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242 where the domain is selected from the group consisting of an N-terminal domain, 
a catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich 
region, a spacer region, an insert, and a C-terminal tail; or (d) an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO: 1 22, SEQ ID NO: 1 23, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 
SEQ ID NO: 1 34, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID NO:l 37, SEQ ID NO: 1 3 8, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ID NO:149, SEQ ID NO: 150, SEQ IDNO:151, SEQ ID NO: 152, SEQ ID NO: 153, 
SEQ ID NO:154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, 
SEQ ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ID NO: 163, 
SEQ ED NO: 164, SEQ ED NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ED NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, 
SEQ ID NO: 1 74, SEQ ED NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ED NO:l 79, SEQ ED NO:180, SEQ ID NO: 1 81, SEQ ID NO:l 82, SEQ ID NO: 1 83, 
SEQ ED NO: 1 84, SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ED NO: 188, 
SEQ ED NO:189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ ED NO: 199, SEQ ID NO: 193, 
SEQ ID NO: 1 94, SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, 
SEQ ED NO: 1 99, SEQ ED NO:200, SEQ ED NO:201 , SEQ ED NO:202, SEQ ED NO:203, 
SEQ ID NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO.207, SEQ DD NO:208, 
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SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO.222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 

5 SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it 
lacks one or more, but not all, of the domains selected from the group consisting of a C- 
terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a proline-rich 

1 0 region, a coiled-coil structure region, an insert, and a C-terminal tail. (The domain 

demarcations of the polypeptides of the invention are indicated in Table 2 by reference to 
the kinase domain.) 

The polypeptide can be isolated from a natural source by methods well-known in 
the art. The natural source may be mammalian, preferably human, blood, semen, or tissue, 

1 5 and the polypeptide may be synthesized using an automated polypeptide synthesizer. The 

isolated, enriched, or purified kinase polypeptide is preferably selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ED NO:128, SEQ ID NO:129, SEQ 
ID NO:130, SEQ ID NO:131, SEQ IDNO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ 

20 ID NO:135, SEQ ID NO:136, SEQ ED NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 

ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ 
ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ 
ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ED NO: 1 53, SEQ ID NO: 1 54, SEQ 
ID NO:l 55, SEQ ID NO: 1 56, SEQ ED NO: 157, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ 

25 ID NO: 160, SEQ ID NO: 161, SEQ DD NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ 

ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ 
ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ 
ID NO: 175, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ED NO:l 79, SEQ 
ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ 

30 ID NO:185, SEQ ID NO:l 86, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 
ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ED NO: 1 99, SEQ ID NO: 1 93 , SEQ ID NO: 1 94, SEQ 

ID NO:195, SEQ IDNO:196, SEQ ID NO:197, SEQ ID NOT 98, SEQ ID NO:199, SEQ 
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ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ 1DN0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 
5 ID NO:220, SEQ ID NO:221, SEQ ID NO:222 ; SEQ ID NO:223, SEQ ID NO:224, SEQ 

ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242A. 

10 In some embodiments the invention includes a recombinant kinase polypeptide 

selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33 , SEQ ED NO: 1 34, 
SEQ ED NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, 

15 SEQ ED NO: 140, SEQ ED NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

SEQ ED NO: 1 45, SEQ ID NO: 1 46, SEQ ED NO: 1 47, SEQ ED NO: 1 48, SEQ ID NO: 1 49, 
SEQ ED NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ED NO: 1 53 , SEQ ID NO: 1 54, 
SEQ ED NO: 1 55, SEQ ED NO: 1 56, SEQ ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, 
SEQ ED NO: 1 60, SEQ ED NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, 

20 SEQ ID NO: 1 65. SEQ ED NO: 1 66, SEQ ID NO:l 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, 

SEQ ED NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ED NO: 173, SEQ ID NO: 174, 
SEQ ID NO:175, SEQ ID NO: 176, SEQ ED NO:177, SEQ ED NO:178, SEQ ID NO: 179, 
SEQ ED NO:l 80, SEQ ID NO: 1 81 , SEQ ID NO:l 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ED NO: 1 85, SEQ ED NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, 

25 SEQ ED NO: 1 90, SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, 

SEQ ED NO: 1 95, SEQ ED NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, 
SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ID NO:204, 
SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, 
SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, 

30 SEQ ED NO:2 1 5, SEQ ED NO:2 1 6, SEQ ED NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, 
SEQ ED NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, 
SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
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SEQ ID NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237 5 SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242. By "recombinant kinase 
polypeptide" is meant a polypeptide produced by recombinant DNA techniques such that 
it is distinct from a naturally occurring polypeptide either in its location (e.g., present in a 
different cell or tissue than found in nature), purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that 
normally observed in nature. 

-In- a -fifth aspect, the invention features an antibody (e.g., a monoclonal or 

polyclonal antibody) having specific binding affinity to a kinase polypeptide or a kinase 
polypeptide domain or fragment where the polypeptide is selected from the group 
consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
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ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ED NO:242. In preferred embodiments, the antibody binds 
5 specifically to domains of kinase polypeptides, that are defined supra. 

By "specific binding affinity" is meant that the antibody binds to the target kinase 
polypeptide with greater affinity than it binds to other polypeptides under specified 
conditions. Antibodies or antibody fragments are polypeptides that contain regions that 
can bind other polypeptides. The term "specific binding affinity" describes an antibody 

1 0 that binds to a kinase polypeptide with greater affinity than it binds to other polypeptides 

under specified conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 
antigenic functional derivative thereof For the production of polyclonal antibodies, 

15 various host animals may be immunized by injection with the antigen. Various adjuvants 

may be used to increase the immunological response, depending on the host species. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 
to a particular antigen. They may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 

20 antibodies may be obtained by methods known to those skilled in the art (Kohler ei aL, 

Nature 256:495-497, 1975, and U.S. Patent No. 4,376,1 1 0, both of which are hereby 
incorporated by reference herein in their entirety including any figures, tables, or 
drawings). 

The term "antibody fragment" refers to a portion of an antibody, often the hyper 
25 variable region and portions of the surrounding heavy and light chains, that displays 

specific binding affinity for a particular molecule. A hyper variable region is a portion of 
an antibody that physically binds to the polypeptide target. 

Antibodies or antibody fragments having specific binding affinity to a kinase 
polypeptide or domains of a kinase polypeptide of the invention may be used in methods 
30 for detecting the presence and/or amount of kinase polypeptide in a sample by probing the 

sample with the antibody under conditions suitable for kinase-antibody immunocomplex 
formation and detecting the presence and/or amount of the antibody conjugated to the 
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kinase polypeptide. Diagnostic kits for performing such methods may be constructed to 
include antibodies or antibody fragments specific for the kinase as well as a conjugate of a 
binding partner of the antibodies or the antibodies themselves. 

An antibody or antibody fragment with specific binding affinity to a kinase 
polypeptide of the invention can be isolated, enriched, or purified from a prokaryotic or 
eukaryotic organism. Routine methods known to those skilled in the art enable production 
of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms. 
Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are 
described above. 

Antibodies having specific binding affinity to a kinase polypeptide of the invention 
may be used in methods for detecting the presence and/or amount of kinase polypeptide in 
a sample by contacting the sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or amount of the antibody 
conjugated to the kinase polypeptide. Diagnostic kits for performing such methods may 
be constructed to include a first container containing the antibody and a second container 
having a conjugate of a binding partner of the antibody and a label, such as, for example, a 
radioisotope. The diagnostic kit may also include notification of an FDA approved use 
and instructions therefor. 

In a sixth aspect, the invention features a hybridoma which produces an antibody 
having specific binding affinity to a kinase polypeptide or a kinase polypeptide domain, 
where the polypeptide is selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
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NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
5 NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 

NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 

10 NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 

NO:228, SEQ IDNO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242; and 
where the domains are defined as above. By "hybridoma" is meant an immortalized cell 

1 5 line that is capable of secreting an antibody, for example an antibody to a kinase of the 

invention. In preferred embodiments, the antibody to the kinase comprises a sequence of 
amino acids that is able to specifically bind a kinase polypeptide of the invention. 



In a seventh aspect, the invention features a kinase polypeptide binding agent able 
to bind to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, 
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SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO : 1 9 1 , SEQ ID NO: 1 99, 
SEQ ID NO: 193, SEQ ID NO: 194, SEQ IDNO:195, SEQ ID NO: 196, SEQ ID NO: 197, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
5 SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 

SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, 

10 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 

SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:24 1 , and SEQ ID 
NO:242. The binding agent is preferably a purified antibody that recognizes an epitope 
present on a kinase polypeptide of the invention. Other binding agents include molecules 
that bind to kinase polypeptides and analogous molecules that bind to a kinase 

1 5 polypeptide. Such binding agents may be identified by using assays that measure kinase 

binding partner activity, such as those that measure PDGFR activity. 

The invention also features a method for screening for human cells containing a 
kinase polypeptide of the invention or an equivalent sequence. The method involves 
identifying the novel polypeptide in human cells using techniques that are routine and 

20 standard in the art, such as those described herein for identifying the kinases of the 

invention (e.g., cloning, Southern or Northern blot analysis, in situ hybridization, PCR 
amplification, etc.). 

In an eighth aspect, the invention features methods for identifying a substance that 
modulates kinase activity comprising the steps of: (a) contacting a kinase polypeptide 

25 selected from the group consisting of SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, 
SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, 
SEQ IDNO:135, SEQ IDNO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, 
SEQ ID NO.T40, SEQ ID NO:141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

30 SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID NO:l 55, SEQ ED NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 
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SEQ ID NO: 1 60, SEQ ID NO: 161, SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, 
SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ED NO: 168, SEQ ID NO: 1 69, 
SEQ ID NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, 
SEQ ED NO:175, SEQ ED NO: 176, SEQ ED NO: 177, SEQ ED NO:178, SEQ ED NO: 179, 

5 SEQ ED NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, 

SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ED NO: 1 88, SEQ ID NO: 1 89, 
SEQ ED NO: 1 90, SEQ ED NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, 
SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, 
SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 

10 SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 

SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, 
SEQ ED NO:220, SEQ ED NO:221 , SEQ ED NO:222, SEQ ED NO:223, SEQ ID NO:224, 
SEQ ED NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, 

15 SEQ ID NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ED NO:233, SEQ ID NO:234, 

SEQ ID NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED N0.239, 
SEQ ED NO:240, SEQ ED NO:24], and SEQ ED NO:242 with a test substance; (b) 
measuring the activity of said polypeptide; and (c) determining whether said substance 
modulates the activity of said polypeptide. 

20 The term "modulates" refers to the ability of a compound to alter the function of a 

kinase of the invention. A modulator preferably activates or inhibits the activity of a 
kinase of the invention. 

The term "activates" refers to increasing the cellular activity of the kinase. The 
term inhibit refers to decreasing the cellular activity of the kinase. Kinase activity is 

25 preferably the interaction with a natural binding partner. 

The term "modulates" also refers to altering the function of kinases of the 
invention by increasing or decreasing the probability that a complex forms between the 
kinase and a natural binding partner. A modulator preferably increases the probability that 
such a complex forms between the kinase and the natural binding partner, more preferably 

30 increases or decreases the probability that a complex forms between the kinase and the 

natural binding partner depending on the concentration of the compound exposed to the 
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kinase, and most preferably decreases the probability that a complex forms between the 
kinase and the natural binding partner. 

The term "complex" refers to an assembly of at least two molecules bound to one 
another. Signal transduction complexes often contain at least two protein molecules 
5 bound to one another. For instance, a protein tyrosine receptor protein kinase, GRB2, 

SOS, RAF, and RAS assemble to form a signal transduction complex in response to a 
mitogenic ligand. 

The term "natural binding partner" refers to polypeptides, lipids, small molecules, 
or nucleic acids that bind to kinases in cells. A change in the interaction between a kinase 

10 and a natural binding partner can manifest itself as an increased or decreased probability 

that the interaction forms, or an increased or decreased concentration of kinase/natural 
binding partner complex. 

The term "contacting" as used herein refers to mixing a solution comprising the 
test compound with a liquid medium bathing the cells of the methods. The solution 

15 comprising the compound may also comprise another component, such as dimethyl 

sulfoxide (DMSO), which facilitates the uptake of the test compound or compounds into 
the cells of the methods. The solution comprising the test compound may be added to the 
medium bathing the cells by utilizing a delivery apparatus, such as a pipet-based device or 
syringe-based device. 

20 In a ninth aspect, the invention features methods for identifying a substance that 

modulates kinase activity in a cell comprising the steps of: (a) expressing a kinase 
polypeptide in a cell, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25 , SEQ ID NO: 1 26, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ED NO: 129, SEQ ID NO: 130, SEQ ID NO:131, 

25 SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO:136, 

SEQ ED NO: 1 37, SEQ ID NO: 1 3 8, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , 
SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, 

30 SEQ ID NO:l 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , 
SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, 
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15 and SEQ ID NO:242; (b) adding a test substance to said cell; and (c) monitoring a change 

in cell phenotype or the interaction between said polypeptide and a natural binding 
partner. 

The term "expressing" as used herein refers to the production of kinases of the 
invention from a nucleic acid vector containing kinase genes within a cell. The nucleic 
20 acid vector is transfected into cells using well known techniques in the art as described 

herein. 

In a tenth aspect, the invention provides methods for treating a disease or abnormal 
condition by administering to a patient in need of such treatment a substance that 
modulates the activity of a polypeptide selected from the group consisting of SEQ ID 

25 NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 

NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 

30 NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 

NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 
NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID 
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NO: 162, SEQ ID NO: 163, SEQ ID NO:l 64, SEQ ID NO: 165. SEQ ED NO: 166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ID 
5 NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO:l 88, SEQ ID NO:l 89, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

10 NO:207, SEQ IDNO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:21 3, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

1 5 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. Preferably, the disease is selected from the group consisting of immune- 
related diseases and disorders, cardiovascular disease, neurodegenerative disorders, and 
cancer. Also included are metabolic disorders, such as diabetes mellitus, and reproductive 

20 disorders, such as infertility. 

Preferably, the disease or disorder is selected from the group consisting of 
rheumatoid arthritis, artherosclerosis, autoimmune disorders, and organ transplantation. 
Preferably the disease or disorder is selected from the group consisting of immune-related 
diseases and disorders, myocardial infarction, cardiomyopathies, stroke, renal failure, and 

25 oxidative stress-related neurodegenerative disorders. Most preferably, the immune-related 

diseases and disorders are selected from the group consisting of rheumatoid arthritis, 
chronic inflammatory bowel disease, chronic inflammatory pelvic disease, multiple 
sclerosis, asthma, osteoarthritis, psoriasis, atherosclerosis, rhinitis, autoimmunity, and 
organ transplantation. 

30 Substances useful for treatment of disorders or diseases preferably show positive 

results in one or more in vitro assays for an activity corresponding to treatment of the 
disease or disorder in question Substances that modulate the activity of the polypeptides 
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preferably include, but are not limited to, antisense oligonucleotides and inhibitors of 
protein kinases. 

The term 4 preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 
5 The term "treating" refers to having a therapeutic effect and at least partially 

alleviating or abrogating an abnormal condition in the organism. 

The tenn "therapeutic effect" refers to the inhibition or activation factors causing 
or contributing to the abnormal condition. A therapeutic effect relieves to some extent one 
or more of the symptoms of the abnormal condition. In reference to the treatment of 
10 abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an 

increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (i.e., 
slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some 
extent one or more of the symptoms associated with the abnormal condition; and (e) 
enhancing the function of the affected population of cells. Compounds demonstrating 
15 efficacy against abnormal conditions can be identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
condition can relate to cell proliferation, cell differentiation or cell survival. An abnormal 
condition may also include irregularities in cell cycle progression, i.e., irregularities in 
20 normal cell cycle progression through mitosis and meiosis. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenesis, wound healing, psoriasis, 
diabetes mellitus, and inflammation. 

Abnormal differentiation conditions include, but are not limited to 
25 neurodegenerative disorders, slow wound healing rates, and slow tissue grafting healing 

rates. 

Abnormal cell survival conditions relate to conditions in which programmed cell 
death (apoptosis) pathways are activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathways. Aberrations in the function of any one of the 
30 protein kinases could lead to cell immortality or premature cell death. 



WO 00/73469 



PCTAJS00/14842 



The term "aberration", in conjunction with the function of a kinase in a signal 
transduction process, refers to a kinase that is over- or under-expressed in an organism, 
mutated such that its catalytic activity is lower or higher than wild-type protein kinase 
activity, mutated such that it can no longer interact with a natural binding partner, is no 
5 longer modified by another protein kinase or protein phosphatase, or no longer interacts 

with a natural binding partner. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated when 
the cells or tissues of the organism exist within the organism or outside of the organism. 

10 Cells existing outside the organism can be maintained or grown in cell culture dishes. For 

cells harbored within the organism, many techniques exist in the art to administer 
compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol 
applications. For cells outside of the organism, multiple techniques exist in the art to 
administer the compounds, including (but not limited to) cell microinjection techniques, 

15 transformation techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated by administering a 
compound to a group of cells having an aberration in a signal transduction pathway to an 
organism. The effect of administering a compound on organism function can then be 
monitored. The organism is preferably a mouse, rat, rabbit, guinea pig, or goat, more 

20 preferably a monkey or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods for detection the expression 
of a polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting the sample with a nucleic acid probe which 
hybridizes under hybridization assay conditions to a nucleic acid target region of a kinase 

25 polypeptide selected from the group consisting of SEQ ED NO:122, SEQ ID NO:123, SEQ 

ID NO:124, SEQ ID NO:125, SEQ ID NO: 126, SEQ ID NO:127, SEQ ID NO.T28, SEQ 
ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:139, SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 

30 ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 
ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ 
ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ 
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ID NO: 159, SEQ ED NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ 
ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ 
ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 
ED NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO:178, SEQ 
5 ID NO: 179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID NO: 182, SEQ ID NO:183, SEQ 

ID NO:184, SEQ ID NO:l 85, SEQ ID NO:l 86, SEQ ID NO:l 87, SEQ ID NO:l 88, SEQ 
ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ID NO: 1 93, SEQ 
ID NO:194, SEQ ED NO:195, SEQ ED NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ 
ID NO:199, SEQ ED NO:200, SEQ ED NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ 

1 0 ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ED NO:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

15 ID NO:229, SEQ ED NO:230, SEQ ED NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ED NO:239, SEQ ED NO:240, SEQ ED NO:241, and SEQ ID NO:242, said probe 
comprising the nucleic acid sequence encoding the polypeptide, fragments thereof, and the 
complements of the sequences and fragments; and (b) detecting the presence or amount of 

20 the probe:target region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease or disorder is selected from 
the group consisting of rheumatoid arthritis, artherosclerosis, autoimmune disorders, organ 
transplantation, myocardial infarction, cardiomyopathies, stroke, renal failure, oxidative 
stress-related neurodegenerative disorders, metabolic disorder including diabetes, 

25 reproductive disorders including infertility, and cancer. 

The kinase "target region" is a nucleotide base sequence selected from the group 
consisting of those set forth in SEQ ID NO:l , SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID N0.8, SEQ ID NO:9, SEQ 
ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ED NO:13, SEQ ID NO:14, SEQ ID 

30 NO: 15, SEQ ED NO: 16, SEQ ED NO: 17, SEQ ID NO: 18, SEQ ED NO: 19, SEQ ED NO:20, 
SEQ ID NO:21 , SEQ ED NO:22, SEQ ED N0.23, SEQ ED NO:24, SEQ ID NO:25, SEQ 
ED NO:26, SEQ ID NO:27, SEQ ED NO:28, SEQ ED NO:29, SEQ ID NO:30, SEQ ED 
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NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ED NO:34, SEQ ID NO:35, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41, SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:51 , SEQ ID NO:52, 
5 SEQ ED NO:53, SEQ ED NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 
ED NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ED N0:61 , SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ED NO:67, SEQ ID NO:68, 
SEQ ID NO:69, SEQ ED NO:70, SEQ ID N0:71, SEQ ID NO:72, SEQ ED NO:73, SEQ 
ID NO:74, SEQ ED NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 

10 NO:79, SEQ ED NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, 
SEQ ID NO:85, SEQ ED NO:86, SEQ ID NO:87, SEQ ED NO:88, SEQ ID NO:89, SEQ 
ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID 
NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO: 1 00, SEQ ID NO: 1 01 , SEQ ED NO: 1 02, SEQ ID NO: 1 03, SEQ ID NO: 1 04, SEQ ID 

15 NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, SEQ ID 

NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 11 4, SEQ ID 
NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 11 9, SEQ ID 
NO: 120, and SEQ ED NO: 121 , or the corresponding full-length sequences, a functional 
derivative thereof, or a fragment thereof to which the nucleic acid probe will specifically 

20 hybridize. Specific hybridization indicates that in the presence of other nucleic acids the 

probe only hybridizes detectably with the kinase of the invention's target region. Putative 
target regions can be identified by methods well known in the art consisting of alignment 
and comparison of the most closely related sequences in the database. 

In preferred embodiments the nucleic acid probe hybridizes to a kinase target 

25 region encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 
amino acids of the sequence set forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ED 
NO: 124, SEQ ID NO: 125, SEQ ED NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID 
NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID 
NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 

30 NO: 139, SEQ ID NO: 140, SEQ ID NO.141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID 
NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID 
NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID 
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NO: 154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID 
NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID 
NO: 164, SEQ ID NO:165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID 

5 NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO:l 84, SEQ ID NO:l 85, SEQ ID NO:l 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 
NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID 
NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID 

10 NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:21 0, SEQ ID NO:21 1 , SEQ ID NO:2 12, SEQ ID NO:21 3, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ IDNO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 

1 5 NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ ID 
NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ ED NO:232, SEQ ED NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or a functional derivative thereof. Hybridization 

20 conditions should be such that hybridization occurs only with the kinase genes in the 

presence of other nucleic acid molecules. Under stringent hybridization conditions only 
highly complementary nucleic acid sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

25 Hybridization conditions should be such that hybridization occurs only with the 

genes in the presence of other nucleic acid molecules. Under stringent hybridization 
conditions only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

30 The diseases for which detection of kinase genes in a sample could be diagnostic 

include diseases in which kinase nucleic acid (DNA and/or RNA) is amplified in 
comparison to normal cells. By "amplification" is meant increased numbers of kinase 
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DNA or RNA in a cell compared with normal cells. In normal cells, kinases are typically 
found as single copy genes. In selected diseases, the chromosomal location of the kinase 
genes may be amplified, resulting in multiple copies of the gene, or amplification. Gene 
amplification can lead to amplification of kinase RNA, or kinase RNA can be amplified in 
5 the absence of kinase DNA amplification. 

"Amplification" as it refers to RNA can be the detectable presence of kinase RNA 
in cells, since in some normal cells there is no basal expression of kinase RNA. In other 
normal cells, a basal level of expression of kinase exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, kinase RNA, 

1 0 compared to the basal level. 

The diseases that could be diagnosed by detection of kinase nucleic acid in a 
sample preferably include cancers. The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells or nucleic acid extracts of 
cells, or biological fluids. The samples used in the above-described methods will vary 

1 5 based on the assay format, the detection method and the nature of the tissues, cells or 

extracts to be assayed. Methods for preparing nucleic acid extracts of cells are well 
known in the art and can be readily adapted in order to obtain a sample that is compatible 
with the method utilized. 

Another aspect of the invention involves a method of agonizing (stimulating) or 

20 antagonizing a target of the invention and a natural binding partner associated activity in a 

mammal comprising administering to said mammal an agonist or antagonist to one of the 
above disclosed polypeptides in an amount sufficient to effect said agonism or 
antagonism. A method of treating diseases in a mammal with an agonist or antagonist of 
the protein of the present invention activity comprising administering the agonist or 

25 antagonist to a mammal in an amount sufficient to agonize or antagonize associated 

functions is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein polypeptides. Some small organic molecules form a class of compounds that 

30 modulate the function of protein polypeptides. Examples of molecules that have been 
reported to inhibit the function of protein kinases include, but are not limited to, bis 
monocyclic, bicyclic or heterocyclic aryl compounds (PCT WO 92/20642, published 
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November 26, 1992 by Maguire et al.), vinylene-azaindole derivatives (PCT WO 
94/14808, published July 7, 1994 by Ballinari et al), l-cyclopropyl-4-pyridyl-quinolones 
(U.S. Patent No. 5,330,992), styry] compounds (U.S. Patent No. 5,217,999), styryl- 
substituted pyridyl compounds (U.S. Patent No. 5,302,606), certain quinazoline 
5 derivatives (EP Application No. 0 566 266 Al), seleoindoles and selenides (PCT WO 
94/03427, published February 17, 1994 by Denny et al.), tricyclic polyhydroxylic 
compounds (PCT WO 92/21660, published December 10, 1992 by Dow), and 
benzylphosphonic acid compounds (PCT WO 91/15495, published October 17, 1991 by 
Dow et al), all of which are incorporated by reference herein, including any drawings. 

10 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein inhibitors only 
weakly inhibit function. In addition, many inhibit a variety of protein kinases and will 
therefore cause multiple side-effects as therapeutics for diseases. 

15 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari el al.) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar groups including hydroxylated alky], 

20 phosphate, and ether substituents. U.S. Patent Application Serial Nos. 08/702,232, filed 

August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
221/187) and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 

25 223/298) and International Patent Publication WO 96/22976, published August 1 , 1 996 by 
Ballinari et al., all of which are incorporated herein by reference in their entirety, 
including any drawings, describe indolinone chemical libraries of indolinone compounds 
harboring other bicyclic moieties as well as monocyclic moieties fused to the oxindole 
nng. Applications 08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial 

30 Libraries and Related Products and Methods for the Treatment of Disease" by Tang et al. 
(Lyon & Lyon Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled 
"Benzylidene-Z-Indoline Compounds for the Treatment of Disease" by Tang et al. (Lyon 



WO 00/73469 



PCT/US00/14842 



& Lyon Docket No. 223/298), and WO 96/22976, published August 1, 1996 by Ballinari 
et al. teach methods of indolinone synthesis, methods of testing the biological activity of 
indolinone compounds in cells, and inhibition patterns of indolinone derivatives, both of 
which are incorporated by reference herein, including any drawings. 
5 Other examples of substances capable of modulating kinase activity include, but 

are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al., EPO Publication No. 0 520 722 

10 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et al., U.S. Patent No. 4,757,072; Kaul 

and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al., Proc. 
of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 3:293- 
304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et al., Br. J. 

15 Cancer 53:361-368 (1986); Fernandes et al., Cancer Research 43:1 1 1 7-1 123 (1983); Ferris 

et al. J. Org. Chem. 44(2): 173-1 78; Fry et al., Science 265:1093-1095 (1994); Jackman et 
al., Cancer Research 5 1 :5579-5586 (1981); Jones et al. J. Med. Chem. 29(6): 1114-1118; 
Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al., J. Org. Chem. 
54:3511-3518(1989); Ley and Seng, Synthesis 1975:415-522 (1975); Maxwell et al., 

20 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al., Cancer Research 

45:325-330(1985); Phillips and Castle, J. Heterocyclic Chem. 17(19):1489-1 596 (1980); 
Reece et al., Cancer Research 47(1 1):2996-2999 (1977); Sculier et al., Cancer Immunol. 
and lmmunother. 23:A65 (1986); Sikora et al., Cancer Letters 23:289-295 (1984); and 
Sikora et al., Analytical Biochem. 172:344-355 (1988), all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incoiporated herein by reference in its entirety, including any drawings. 

Quinolines are described in Dolle et al., J. Med. Chem. 37:2627-2629 (1994); 
MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al., J. Med. Chem. 36:425-432 

30 (1993); and Burke et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 

are incorporated by reference in their entirety, including any drawings. 
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Tyiphostins are described in Allen et al., Clin. Exp. Immunol. 91:141-156 (1993); 
Anafi et al., Blood 82:12:3524-3529 (1993); Baker et al., J. Cell Sci. 102:543-555 (1992); 
Bilder et al., Amer. Phvsiol. Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al., 
Proceedings of Airier. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al., Experimental 

5 Cell Research 199:255-261 (1992); Dong et al., J. Leukocyte Biology 53:53-60 (1993); 

Dong et al., J. Immunol. 151(5):2717-2724 (1993); Gazit et al., J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al., " J. Med. Chem. 36:3556-3564 (1993); Kaur et al., Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al., King et al., Biochem. J. 275:413-418 (1991); Kuo et 
al.. Cancer Letters 74:197-202 (1993); Levitzki, A.. The FASEB J. 6:3275-3282 (1992); 

10 Lyall et al., J. Biol. Chem. 264:14503-14509 (1989); Peterson et al., The Prostate 22:335- 

345 (1993); Pillemer et al., Int. J. Cancer 50:80-85 (1992); Posner et al., Molecular 
Pharmacology 45:673-683 (1993); Rendu et al., Biol. Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, J. 
Pharm. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al., LBioL 

15 Chem. 269(36):22470-22472 (1994); and Yoneda et al., Cancer Research 51:4430-4435 

(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 

Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1 996, 

20 incorporated herein by reference in its entirety, including any drawings. 

Methods of Treating a Disease (Enab lement - i.e.. Dosing) 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 

25 WO 96/22976, published August 1 1 996, both of which are incorporated herein by 

reference in their entirety, including any drawings, figures or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
30 treated, the particular composition being used and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
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formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
5 tumors and major organs can also be determined to facilitate the selection of drugs most 

appropriate to inhibit a disorder. Such measurements can be carried out. For example, 
HPLC analysis can be performed on the plasma of animals treated with the drug and the 
location of radiolabeled compounds can be deter-mined using detection methods such as 
X-ray, CAT scan and MRI. Compounds that show potent inhibitory activity in the 

10 screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 

15 1) the compound is administered to mice (an untreated control mouse should also be 

used); 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A 
comparison of results for each dosing regime with the controls indicates if toxicity is 

20 present. 

At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc. , 202:229-249, 1993). 

25 Representative animals from each treatment group can then be examined by gross 

necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross 
abnormalities in tissue are noted and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 

30 effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 
5 Plasma levels should reflect the potency of the drug. Generally, the more potent 

the compound the lower the plasma levels necessary to achieve efficacy. 

In a final aspect, the invention features a method for detection of a kinase 
polypeptide in a sample as a diagnostic tool for a disease or disorder, wherein the method 
comprises: (a) comparing a nucleic acid target region encoding the kinase polypeptide in 
10 a sample, where the kinase polypeptide is selected from the group consisting of SEQ ID 
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NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or one or more fragments thereof, with a control nucleic acid target region 
encoding the kinase polypeptide, or one or more fragments thereof; and (b) detecting 

5 differences in sequence or amount between the target region and the control target region, 

as an indication of the disease or disorder. Preferably, the disease or disorder is selected 
from the group consisting of immune-related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal failure, oxidative stress-related 
neurodegenerative disorders, and cancer. Immune-related diseases and disorders include, 

1 0 but are not limited to, those discussed previously. 

The term "comparing" as used herein refers to identifying discrepancies between 
the nucleic acid target region isolated from a sample, and the control nucleic acid target 
region. The discrepancies can be in the nucleotide sequences; e.g. insertions, deletions, or 
point mutations, or in the amount of a given nucleotide sequence. Methods to determine 

15 these discrepancies in sequences are well-known to one of ordinary skill in the art. The 

"control" nucleic acid target region refers to the sequence or amount of the sequence 
found in normal cells, e.g. cells that are not diseased as discussed previously. 
The term also includes anti-sense molecules drawn thereto. 
The invention has been described broadly and generically herein. Each of the 

20 narrower species and subgeneric groupings falling within the generic disclosure also form 

part of the invention. This includes the generic description of the invention with a proviso 
or negative limitation removing any subject matter from the genus, regardless of whether 
or not the excised material is specifically recited herein. For example, in some instances 
the nucleotide sequence of particular kinase polypeptides may not be part of a preferred 

25 embodiment. 

The summary of the invention described above is not limiting and other features 
and advantages of the invention will be apparent from the following detailed description of 
the invention, and from the claims. 



WO 00/73469 



PCT/US00/14842 



73 

BRIEF DESCRIPTION OF THE FIGURES 
Figures 1 A to IBB shows the amino acid sequences of SEQ ID NO: 122, SEQ ID 
NO: 123, SEQ ID NO: 124, SEQ ID NO:125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID 
NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 5 5, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID 
NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 
NO:188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID 
NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID 
NO:198, SEQ IDNO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ IDNO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

Figures 2A to 2MMMM shows the nucleic acid sequences of SEQ ID NO: 1 , SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID 
NO: 1 3, SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, 
SEQ ID NO:l 9, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ 
ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
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NO:29, SEQ ID NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ED NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ 
ID NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
SEQ ID N0:51 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ 
ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71 , SEQ 
ED NO:72, SEQ ED NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ED NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0.81, SEQ ID NO:82, 
SEQ ED NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ 
ED NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, 
SEQ ED NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ED NO: 108, 
SEQ ID NO: 1 09, SEQ ED NO: 1 1 0, SEQ ED N0:1 1 1 , SEQ ED NO: 1 1 2, SEQ ID NO: 1 1 3, 
SEQ ED N0:1 14, SEQ ED N0:1 15, SEQ ID N0:1 16, SEQ ED NO:l 17, SEQ ED N0:1 18, 
SEQ ID N0:1 19, SEQ ID NO: 120, and SEQ ID NO: 121. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates in part to kinase polypeptides, nucleic acids encoding 
such polypeptides, cells containing such nucleic acids, antibodies to such polypeptides, 
assays utilizing such polypeptides, and methods relating to all of the foregoing. The 
present invention is based upon the isolation and characterization of new kinase 
polypeptides. The polypeptides and nucleic acids may be produced using well-known and 
standard synthesis techniques when given the sequences presented herein. 

I. The Nucleic Acids of the Invention 

Included within the scope of this invention are the functional equivalents of the 
herein-described isolated nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other codons that specify the same amino acid 
and hence would give rise to the same protein. The nucleic acid sequence can vary 
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substantially since, with the exception of methionine and tryptophan, the known amino 
acids can be coded for by more than one codon. Thus, portions or all of the kinase genes 
of the invention could be synthesized to give a nucleic acid sequence significantly 
different from one selected from the group consisting of those set forth in SEQ ID NO:l, 
5 SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ED NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1 , SEQ ID NO: 12, 
SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ED NO:17, SEQ 
ID NO: 18, SEQ ID NO: 19, SEQ DD NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ DD NO:25, SEQ ED NO:26, SEQ DD NO:27, SEQ DD NO:28, 

10 SEQ DD NO:29, SEQ DD NO:30, SEQ DD NO:31, SEQ DD NO:32, SEQ DD NO:33, SEQ 
DD NO:34, SEQ DD NO:35, SEQ DD NO:36, SEQ DD NO:37, SEQ DD NO:38, SEQ DD 
NO:39, SEQ DD NO:40, SEQ DD NO:41 , SEQ ED NO:42, SEQ DD NO:43, SEQ ID NO:44, 
SEQ DD NO:45, SEQ DD NO:46, SEQ DD NO:47, SEQ DD NO:48, SEQ ID NO:49, SEQ 
DD NO:50, SEQ DD NO:51, SEQ DD NO:52, SEQ DD NO:53, SEQ ID NO:54, SEQ ID 

15 NO:55, SEQ ID NO:56, SEQ DD NO:57, SEQ ID NO:58, SEQ ED NO:59, SEQ DD NO:60, 

SEQ ED NO:61, SEQ DD NO:62, SEQ DD NO:63, SEQ DD NO:64, SEQ ID NO:65, SEQ 
ID NO:66, SEQ DD NO:67, SEQ ID NO:68, SEQ DD NO:69, SEQ ID NO:70, SEQ ID 
NO:71 , SEQ ED NO:72, SEQ DD NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, 
SEQ DD NO:77, SEQ DD NO:78, SEQ DD NO:79, SEQ DD NO:80, SEQ DD NO:8 1 , SEQ 

20 ID NO:82, SEQ DD NO:83, SEQ DD NO:84, SEQ DD NO:85, SEQ DD NO:86, SEQ ID 

NO:87, SEQ DD NO:88, SEQ DD NO:89, SEQ ID NO:90, SEQ DD NO:91 , SEQ ID NO:92, 
SEQ DD NO:93, SEQ DD NO:94, SEQ ID NO:95, SEQ DD NO:96, SEQ DD NO:97, SEQ 
ID NO:98, SEQ DD NO:99, SEQ DD NO:100, SEQ DD NO:101, SEQ DD NO:102, SEQ DD 
NO: 1 03, SEQ ED NO: 1 04, SEQ DD NO: 1 05, SEQ DD NO: 1 06, SEQ DD NO: 1 07, SEQ DD 

25 NO: 1 08, SEQ ID NO: 1 09, SEQ DD NO: 1 1 0, SEQ DD NO: 1 1 1 , SEQ DD NO: 1 1 2, SEQ DD 
NO: 1 1 3, SEQ DD NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID 
NO: 11 8, SEQ ID NO: 1 1 9, SEQ ID NO: 1 20, and SEQ ID NO: 1 2 1 . The encoded amino 
acid sequence thereof would, however, be preserved. 

In addition, the nucleic acid sequence may comprise a nucleotide sequence which 

30 results from the addition, deletion or substitution of at least one nucleotide to the 5 '-end 
and/or the 3 '-end of the nucleic acid sequence shown in SEQ DD NO:l, SEQ ED NO:2, 
SEQ DD NO:3, SEQ DD NO:4, SEQ ID NO:5, SEQ DD NO:6, SEQ ID NO:7, SEQ ID 



WO 00/73469 



76 



PCT/US00/14842 



NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO. l 1, SEQ ID NO: 12, SEQ ID NO: 13, 
SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ 
ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ED NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
5 SEQ ID NO:30, SEQ ID NO:3 1, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ED NO:54, SEQ ID NO:55, SEQ ID 

10 NO:56, SEQ ED NO:57, SEQ ED NO:58, SEQ ED NO:59, SEQ ID NO:60, SEQ ID NO:61 , 
SEQ ED NO:62, SEQ ED NO:63, SEQ ED NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ 
ED NO:67, SEQ ED NO:68, SEQ ED NO:69, SEQ ED NO:70, SEQ ED N0:71, SEQ ED 
NO:72, SEQ ED NO:73, SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ID NO:77, 
SEQ ED NO:78, SEQ ED NO:79, SEQ ED NO:80, SEQ ED N0:81, SEQ ED NO:82, SEQ 

15 ED NO:83, SEQ ED NO:84, SEQ ED NO:85, SEQ ED NO:86, SEQ ED NO:87, SEQ ID 

NO:88, SEQ ED NO:89, SEQ ED NO:90, SEQ ED N0:91 , SEQ ED NO:92, SEQ ED NO:93, 
SEQ ED NO:94, SEQ ED NO:95, SEQ ED NO:96, SEQ ED NO:97, SEQ ID NO:98, SEQ 
ID NO:99, SEQ ED NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID 
NO: 104, SEQ ED NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID 

20 NO: 109, SEQ ED NO:l 10, SEQ ED NO: 11 1, SEQ ID NO:l 12, SEQ ID NO:l 13, SEQ ID 
NO: 1 1 4, SEQ ED NO: 1 1 5, SEQ ED NO: 1 1 6, SEQ ED NO: 1 1 7, SEQ ED NO: 1 1 8, SEQ ID 
NO:l 19, SEQ ED NO: 120, and SEQ ID NO: 121, or a derivative thereof. Any nucleotide 
or polynucleotide may be used in this regard, provided that its addition, deletion or 
substitution does not alter the amino acid sequence of SEQ ED NO: 122, SEQ ID NO: 123, 

25 SEQ ED NO: 1 24, SEQ ID NO: 1 25, SEQ ED NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 1 28, 
SEQ ED NO:129, SEQ ED NO:130, SEQ ED NO:131, SEQ ED NO:132, SEQ ED NO:133, 
SEQ ID NO: 134, SEQ ED NO: 135, SEQ ED NO: 136, SEQ ED NO: 137, SEQ ED NO: 138, 
SEQ ED NO: 139, SEQ ED NO: 140, SEQ ED NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, 
SEQ ED NO:144, SEQ ED NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ED NO:148, 

30 SEQ ID NO: 149, SEQ ED NO: 150, SEQ ED NO:151, SEQ ED NO: 152, SEQ ID NO: 153, 
SEQ ID NO: 1 54, SEQ ED NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 
SEQ ID NO:159, SEQ ED NO:160, SEQ ED NO:161, SEQ ED NO:162, SEQ ID NO:163, 
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SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, that is encoded 
by the nucleotide sequence. For example, the present invention is intended to include any 
nucleic acid sequence resulting from the addition of ATG as an initiation codon at the 5'- 
end of the inventive nucleic acid sequence or its derivative, or from the addition of TTA, 
20 TAG or TGA as a termination codon at the 3'-end of the inventive nucleotide sequence or 

its derivative. Moreover, the nucleic acid molecule of the present invention may, as 
necessary, have restriction endonuclease recognition sites added to its 5'-end and/or 3'- 
end. 

Such functional alterations of a given nucleic acid sequence afford an opportunity 
25 to promote secretion and/or processing of heterologous proteins encoded by foreign 

nucleic acid sequences fused thereto, for example. AH variations of the nucleotide 
sequence of the kinase genes of the invention and fragments thereof permitted by the 
genetic code are, therefore, included in this invention. 

Further, it is possible to delete codons or to substitute one or more codons with 
30 codons other than degenerate codons to produce a structurally modified polypeptide, but 
one which has substantially the same utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art, the two polypeptides are 
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functionally equivalent, as are the two nucleic acid molecules that give rise to their 
production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. This is discussed further in the "Functional 
Derivatives" section, herein. 
5 Finally, many of the nucleic acid molecules of the invention are provided as a 

partial sequence only (Fig. 2 A through 2QQ). However, it is standard for one of ordinary 
skill in the art to obtain a full-length sequence when provided with a partial sequence. 
Similarly, when provided with a partial or full-length sequence it is standard for one of 
ordinary skill in the art to obtain nucleic acid sequence coding for homologous proteins. 

10 Therefore, these nucleic acid molecules are also part of the invention. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1. The protein kinases fall into 10 known groups: AGC, CAMK, CK1, 
CMGC, dsPK, EBFK, LEMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

1 5 presumably define new protein kinase groups. 

Additional characteristics may be found, inter alia, in the tables, namely Table 1 , 
Table 2, Table 3 and Table 4, shown below. 

II. Nucleic Acid Probes, Methods, and Kits for Detection of Pr otein Kinases. 

20 A nucleic acid probe of the present invention may be used to probe an appropriate 

chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 

25 Sambrook, Fritsch, & Maniatis, eds., 1 989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences that correspond to N-terminal, kinase or C- 
terminal portions, for example, of the amino acid sequence of the polypeptide of interest. 
The synthesized nucleic acid probes may be used as primers in a polymerase chain 

30 reaction (PCR) carried out in accordance with recognized PCR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and Applications", Academic Press, 
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Michael, et ai, eds., 1990, utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment of the present invention. 

One skilled in the art can readily design such probes based on the sequence 
disclosed herein using methods of computer alignment and sequence analysis known in 
5 the art ("Molecular Cloning: A Laboratory Manual", 1 989, supra). The hybridization 

probes of the present invention can be labeled by standard labeling techniques such as 
with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 
The nucleic acid probes of the present invention include RNA, as well as DNA 

10 probes, such probes being generated using techniques known in the art. The nucleic acid 

probe may be immobilized on a solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 

15 art. 

The test samples suitable for nucleic acid probing methods of the present invention 
include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 

20 preparing nucleic acid extracts of cells are well known in the art and can be readily 

adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting said sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of said probe 

25 bound to said nucleic acid molecule. One skilled in the art would select the nucleic acid 

probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 

30 nucleic acid probe. The kit may further comprise other containers comprising one or more 
of the following: wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
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radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or steptavidin). 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 
5 or strips of plastic or paper. Such containers allow the efficient transfer of reagents from 

one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 

10 primers used in the assay, containers which contain wash reagents (such as phosphate 

buffered saline, Tris-buffers, and the like), and containers which contain the reagents used 
to detect the hybridized probe, bound antibody, amplified product, or the like. One skilled 
in the art will readily recognize that the nucleic acid probes described in the present 
invention can readily be incorporated into one of the established kit formats that are well 

15 known in the art. 

m. DNA Constructs Comprising a Protein Kinase Nucleic Acid Molecule and Cells 
Containing These Constructs. 

The present invention also relates to a recombinant DNA molecule comprising, 5' 
20 to 3', a promoter effective to initiate transcription in a host cell and the above-described 

nucleic acid molecules. In addition, the present invention relates to a recombinant DNA 
molecule comprising a vector and an above-described nucleic acid molecule. The present 
invention also relates to a nucleic acid molecule comprising a transcriptional region 
functional in a cell, a sequence complementary to an RNA sequence encoding an amino 
25 acid sequence corresponding to the above-described polypeptide, and a transcriptional 

termination region functional in said cell. The above-described molecules may be isolated 
and/or purified DNA molecules. 

The present invention also relates to a cell or organism that contains an above- 
described nucleic acid molecule and thereby is capable of expressing a polypeptide. The 
30 polypeptide may be purified from cells that have been altered to express the polypeptide. 
A cell is said to be "altered to express a desired polypeptide" when the cell, through 
genetic manipulation, is made to produce a protein which it normally does not produce or 
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which the cell normally produces at lower levels. One skilled in the art can readily adapt 
procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
polypeptide if it contains nucleotide sequences which contain transcriptional and 
translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 
regulatory regions needed for gene sequence expression may vary from organism to 
organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those 5'-non-coding sequences involved with initiation of 
transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 
and the like. 

If desired, the non-coding region 3 5 to the sequence encoding a kinase of the 
invention may be obtained by the above-described methods. This region may be retained 
for its transcriptional termination regulatory sequences, such as termination and 
polyadenylation. Thus, by retaining the 3' -region naturally contiguous to the DNA 
sequence encoding a kinase of the invention, the transcriptional termination signals may 
be provided. Where the transcriptional termination signals are not satisfactorily functional 
in the expression host cell, then a 3' region functional in the host cell may be substituted. 

Two DNA sequences (such as a promoter region sequence and a sequence 
encoding a kinase of the invention) are said to be operably linked if the nature of the 
linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the invention, or (3) interfere with 
the ability of the gene sequence of a kinase of the invention to be transcribed by the 
promoter region sequence. Thus, a promoter region would be operably linked to a DNA 
sequence if the promoter were capable of effecting transcription of that DNA sequence. 
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Thus, to express a gene encoding a kinase of the invention, transcriptional and 
translational signals recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding a kinase of 
the invention (or a functional derivative thereof) in either prokaryotic or eukaryotic cells. 
5 Prokaryotic hosts are, generally, very efficient and convenient for the production of 

recombinant proteins and are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryotes most frequently are represented by various strains of 
E. coli. However, other microbial strains may also be used, including other bacterial 
strains. 

10 In prokaryotic systems, plasmid vectors that contain replication sites and control 

sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUCl 19 and the like; suitable 
phage or bacteriophage vectors may include ygtlO, ygtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 

1 5 present invention has the capacity to replicate in the selected host cell. 

Recognized prokaryotic hosts include bacteria such as E. coli, Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 

20 To express a kinase of the invention (or a functional derivative thereof) in a 

prokaryotic cell, it is necessary to operably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such promoters may be either constitutive 
or, more preferably, regulatable (i.e., inducible or derepressible). Examples of constitutive 
promoters include the int promoter of bacteriophage X, the bla promoter of the P- 

25 lactamase gene sequence of pBR322, and the cat promoter of the chloramphenicol acetyl 

transferase gene sequence of pPR325, and the like. Examples of inducible prokaryotic 
promoters include the major right and left promoters of bacteriophage X (P L and P R ), the 
trp, recA. AacZ, AacI, and gal promoters of E. coli, the a-amylase (Ulmanen et al. 9 J. 
Bacteriol. 162:176-182, 1985) and the c-28-specific promoters of B. subtilis (Gilman et 

30 ai, Gene Sequence 32:1 1-20, 1984), the promoters of the bacteriophages of Bacillus 

(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY, 1 982), and 
Streptomyces promoters (Ward et al, Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic 
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promoters are reviewed by Glick (Lnd. Microbiol. 1 :277-282, 1987), Cenatiempo 
(Biochimie 68:505-516, 1986), and Gottesman (Ann. Rev. Genet. 18:415-442, 1984). 

Proper expression in a prokaryotic cell also requires the presence of a ribosome- 
binding site upstream of the gene sequence-encoding sequence. Such ribosome-binding 

5 sites are disclosed, for example, by Gold et al (Ann. Rev. Microbiol. 35:365-404, 1981). 

The selection of control sequences, expression vectors, transfonnation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture" may be used interchangeably and all such 
designations include progeny. Thus, the words "transformants" or transformed cells" 

10 include the primary subject cell and cultures derived therefrom, without regard to the 

number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 
progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 

15 are not strictly limited, provided that they are suitable for use in the expression of the 

kinase polypeptide of interest. Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, fungi, insect cells, mammalian cells 
either in vivo, or in tissue culture. Mammalian cells which may be useful as hosts include 
HeLa cells, cells of fibroblast origin such as VERO or CHO-K1, or cells of lymphoid 

20 origin and their derivatives. Preferred mammalian host cells include SP2/0 and J558L, as 

well as neuroblastoma cell lines such as EMR 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

25 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 

preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors can be engineered to express 
large amounts of kinases of the invention in insect cells (Jasny, Science 238:1653, 1987; 

30 Miller et al., In: Genetic Engineering, Vol. 8, Plenum, Setlow et al, eds., pp. 277-297, 

1986). 
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Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 
rich in glucose. Known glycolytic gene sequences can also provide very efficient 
transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 
cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of kinases of 
the invention in a mammalian host. 

A wide variety of transcriptional and translation^ regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine 
papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic hosts requires the use of 
eukaryotic regulatory regions. Such regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter of the mouse metallothionein 1 gene sequence (Hamer 
et ai, J. Mol. Appl. Gen. 1 :273-288, 1982); the TK promoter of Herpes virus (McKnight, 
Cell 31 :355-365, 1 982); the SV40 early promoter (Benoist et ai, Nature (London) 
290:304-31, 1981); and the yeast gal4 gene sequence promoter (Johnston et ai, Proc. Natl. 
Acad. Sci. (USA) 79:6971-6975, 1982; Silver et ai, Proc. Natl. Acad. Sci. (USA) 
81:5951-5955, 1984). 
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Translation of eukaryotic mRNA is initialed at the codon that encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes a kinase of the invention (or a 
functional derivative thereof) does not contain any intervening codons which are capable 
of encoding a methionine (i.e., AUG). The presence of such codons results either in the 
formation of a fusion protein (if the AUG codon is in the same reading frame as the kinase 
of the invention coding sequence) or a frame-shift mutation (if the AUG codon is not in 
the same reading frame as the kinase of the invention coding sequence). 

A . nucleic acid molecule encoding a kinase of the invention and an operably linked 
promoter may be introduced into a recipient pTokaryotic or eukaryotic cell either as a 
nonreplicating DNA or RNA molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous replication, the expression of the gene may occut through the transient 
expression of the introduced sequence. Alternatively, permanent expression may occur 
through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
sequence can either be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 
transcription promoters, enhancers, and termination signals. cDNA expression vectors 
incorporating such elements include those described by Okayama (Mol. Cell. Biol. 3:280-, 
1983). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected from those recipient cells which do not contain the vector; 
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the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
inf. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184, tcVX; "Molecular 
5 Cloning: A Laboratory Manual", 1989, supra). Bacillus plasmids include pC194, pC221 , 

pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi J 1 01 (Kendall et aL, 
J. Bacteriol. 169:4177-4183, 1987), and streptomyces bacteriophages such as <}>C31 
(Chater or a/.,. In: Sixth Intemational Symposium on Actinomycetales Biology, Akademiai 
1 0 Kaido, Budapest, Hungary, pp. 45-54, 1 986). Pseudomonas plasmids are reviewed by 
John et al (Rev. Infect. Dis. 8:693-704, 1986), and Izaki (Jpn. J. Bacteriol. 33:729-742, 
1978). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, SV40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 

15 (Botstein et aL, Miami Wntr. Symp. 19:265-274, 1982; Broach, In: "The Molecular 

Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, Cell 28:203-204, 1982; 
Bollon et a/., J. Clin. Hematol. Oncol. 10:39-48, 1980; Maniatis, In: Cell Biology: A 
Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 

20 563-608, 1980). 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 

25 precipitation, direct microinjection, and the like. After the introduction of the vector, 

recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of a kinase of 
the invention, or fragments thereof. This can take place in the transformed cells as such, 
or following the induction of these cells to differentiate (for example, by administration of 

30 bromodeoxyuracil to neuroblastoma cells or the like). A variety of incubation conditions 
can be used to form the peptide of the present invention. The most preferred conditions 
are those which mimic physiological conditions. 
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IV. The Proteins of the Invention 

A variety of methodologies known in the art can be utilized to obtain the 
polypeptides of the present invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. Alternatively, the above-described isolated 
5 nucleic acid fragments could be used to express the kinases of the invention in any 

organism. The samples of the present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The samples will vary based on the assay 
format, the detection method, and the nature of the tissues, cells or extracts used as the 
sample. 

10 Any eukaryotic organism can be used as a source for the polypeptides of the 

invention, as long as the source organism naturally contains such polypeptides. As used 
herein, "source organism" refers to the original organism from which the amino acid 
sequence of the subunit is derived, regardless of the organism the subunit is expressed in 
and ultimately isolated from. 

1 5 One skilled in the art can readily follow known methods for isolating proteins in 

order to obtain the polypeptides free of natural contaminants. These include, but are not 
limited to: size-exclusion chromatography, HPLC, ion-exchange chromatography, and 
immuno-affinity chromatography. 

Further, the polypeptides of the invention include the full-length polypeptides that 

20 can be identified from the full-length or partial sequences encoded by SEQ ID NO: 122, 

SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, 
SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, 
SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, 
SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 

25 SEQ ED NO:143, SEQ ID NO:144, SEQ ID NO:145 5 SEQ ID NO:146, SEQ ID NO:147, 

SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, 
SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, 
SEQ ID NO:l 58, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, 
SEQ ED NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, 

30 SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, 
SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO : 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82 , 
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SEQ ID NO: 1 83, SEQ ED NO: 1 84, SEQ ED NO:l 85, SEQ ID NO:l 86, SEQ ID NO:l 87, 
SEQ ED NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ED NO: 1 9 1 , SEQ ED NO: 1 99, 
SEQ ED NOi 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, 
SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ ED NO:202, 
5 SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, 

SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ ED NO:21 1, SEQ ED NO:212, 
SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ED NO:216, SEQ ED NO:217, 
SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ED NO:221, SEQ ED NO:222, 
SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ID NO:227, 

10 SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 

SEQ ED NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 
NO:242 (Figure 1). In addition, the polypeptides of the invention include the domains of 
these polypeptides, including, but not limited to, the N-terminal, kinase/catalytic, and C- 

1 5 terminal domains. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1 . The protein kinases fall into 1 0 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EEFK, LIMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 

20 presumably define new protein kinase groups. 

Additional characteristics are shown in, inter alia, the tables, namely Table 1, 
Table 2, Table 3 and Table 4, provided below. 

V. Antibodies, Hvbridomas, Methods of Use and Kits for Detection of Protein 

25 Kinases 

The present invention relates to an antibody having binding affinity to a kinase of 
the invention. The polypeptide may have an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO: 1 22, SEQ ED NO: 1 23, SEQ ID NO: 1 24, SEQ 
ED NO: 125, SEQ ED NO: 126, SEQ ED NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ 

30 ED NO: 130, SEQ ED NO:131, SEQ ED NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ 
ED NO: 1 35, SEQ ED NO: 1 36, SEQ ED NO: 1 37, SEQ ED NO: 1 38, SEQ ED NO: 1 39, SEQ 
ED NO: 1 40, SEQ ED NO: 1 4 1 , SEQ ED NO: 1 42, SEQ ED NO: 1 43 , SEQ ED NO: 1 44, SEQ 
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ED NO: 145, SEQ ID NO: 146, SEQ ED NO: 147, SEQ ED NO: 148, SEQ ID NO: 149, SEQ 
ID NO: 1 50, SEQ ED NO: 1 5 1 , SEQ ED NO: 1 52, SEQ ID NO: 1 53, SEQ ED NO: 1 54, SEQ 
ID NO:155, SEQ ED N0.156, SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO:159, SEQ 
ED NO:160, SEQ ED NO:161, SEQ ID NO:162, SEQ ED NO:163, SEQ ID NO:164, SEQ 
5 ID NO: 165. SEQ ED NO: 166, SEQ ED NO: 167, SEQ ED NO: 168, SEQ ED NO: 169, SEQ 
ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ 
ID NO: 175, SEQ ID NO: 176, SEQ ED NO: 177, SEQ ED NO: 178, SEQ ID NO: 179, SEQ 
ID NO: 1 80, SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ED NO: 1 84, SEQ 
ID NO: 1 85, SEQ ID NO: 1 86, SEQ ED NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ 

10 ED NO:190, SEQ ID NO:191, SEQ ED NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ 
ID NO: 195, SEQ ED NO: 196, SEQ ED NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ 
ID NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ 
ID NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ED NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 

15 ED NO:215, SEQ ED NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ID NO:219, SEQ 

ID NO:220, SEQ ED NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ED NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 

20 ID NO:240, SEQ ID NO:241, and SEQ ED NO:242, or a functional derivative thereof, or 
at least 9 contiguous amino acids thereof (preferably, at least 20, 30, 35, or 40 or more 
contiguous amino acids thereof). Alternatively, the antibody may bind to a part of the 
polypeptide not provided in the sequences above, but that is present in the full-length 
sequence of the polypeptide and that is easily obtained using methods standard in the art. 

25 Further, the antibody may bind specifically to particular domains of one or more of the 

kinases of the invention, including, but not. limited to, the N-terminal, kinase/catalytic, or 
C-terminal domains. 

The present invention also relates to an antibody having specific binding affinity to 
a kinase or kinase domain of the invention. Such an antibody may be isolated by 
30 comparing its binding affinity to a kinase of the invention with its binding affinity to other 
polypeptides. Those that bind selectively to a kinase of the invention would be chosen for 
use in methods requiring a distinction between a kinase of the invention and other 



WO 00/73469 



PCT/US00/14842 



90 

polypeptides. Such methods could include, but should not be limited to, the analysis of 
altered kinase expression in tissue containing other polypeptides. 

The kinases of the present invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for use in identifying phannaceutical 
5 compositions, and for studying DNA/protein interaction. 

The kinases of the present invention can be used to produce antibodies or 
hybridomas. One skilled in the art will recognize that if an antibody is desired, such a 
peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 

10 of these antibodies, and humanized forms. Humanized forms of the antibodies of the 

present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

The present invention also relates to a hybridoma that produces the above- 
described monoclonal antibody, or binding fragment thereof. A hybridoma is an 

15 immortalized cell line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et al, J. Immunol. Methods 35:1-21, 1980). 

20 Any animal (mouse, rabbit, and the like) which is known to produce antibodies can be 

immunized with the selected polypeptide. Methods for immunization are well known in 
the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 

25 polypeptide and the site of injection. 

The polypeptide may be modified or administered in an adjuvant in order to 
increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or p-galactosidase) or through the inclusion of an 

30 adjuvant during immunization. 
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For monoclonal antibodies, spleen cells from the immunized animals are removed, 
fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
5 the desired characteristics. These include screening the hybridomas with an EL1SA assay, 
western blot analysis, or radioimmunoassay (Lutz et aL, Exp. Cell Res. 175:109-124, 
1988). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", supra, 
10 1984). 

For polyclonal antibodies, antibody-containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 

15 radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et ai, 
J.Histochem. Cytochem. 18:315, 1970; Bayer et al % Meth. Enzym. 62:308-, 1979; Engval 

20 et ai, Immunol. 109:129-, 1972; Goding, J. Immunol. Meth. 13:215-, 1976. The labeled 

antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 

25 carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 

and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir et ai 9 "Handbook of Experimental Immunology' 5 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10, 1986; Jacoby et al., Meth. Enzym. 
34, Academic Press, N.Y., 1974). The immobilized antibodies of the present invention 

30 can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 
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Furthermore, one skilled in the art can readily adapt currently available procedures, 
as well as the techniques, methods and kits disclosed herein with regard to antibodies, to 
generate peptides capable of binding to a specific peptide sequence in order to generate 
rationally designed antipeptide peptides (Hurby et ai, "Application of Synthetic Peptides: 
5 Antisense Peptides", In Synthetic Peptides, A User's Guide, W.H. Freeman, NY, pp. 289- 

307, 1992; Kaspczak et ai, Biochemistry 28:9230-9238, 1989). 

Anti-peptide peptides can be generated by replacing the basic amino acid residues 
found in the peptide sequences of the kinases of the invention with acidic residues, while 
maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, 

10 and/or histidine residues are replaced with aspartic acid or glutamic acid and glutamic acid 

residues are replaced by lysine, arginine or histidine. 

The present invention also encompasses a method of detecting a kinase 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 

1 5 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. Altered levels of a kinase of the 
invention in a sample as compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a test sample vary. Incubation 

20 conditions depend on the format employed in the assay, the detection methods employed, 

and the type and nature of the antibody used in the assay. One skilled in the art will 
recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 

25 the present invention. Examples of such assays can be found in Chard ("An Introduction 

to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et ah ("Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 

30 Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1 985). 
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The immunological assay test samples of the present invention include cells, 
protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
urine. The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
5 to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 

well known in the art and can be readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
10 above-described antibody, and (ii) second container means containing a conjugate 

comprising a binding partner of the antibody and a label. In another preferred 
embodiment, the kit further comprises one or more other containers comprising one or 
more of the following: wash reagents and reagents capable of detecting the presence of 
bound antibodies. 

15 Examples of detection reagents include, but are not limited to, labeled secondary 

antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
kits. One skilled in the art will readily recognize that the antibodies described in the 

20 present invention can readily be incorporated into one of the established kit formats that 

are well known in the art. 

VI. Isolation of Compounds That Interact With Protein Kinases 

The present invention also relates to a method of detecting a compound capable of 
binding to a protein kinase of the invention, comprising incubating the compound with a 
25 kinase of the invention and detecting the presence of the compound bound to the kinase. 

The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. 

The present invention also relates to a method of detecting an agonist or antagonist 
of kinase activity or kinase binding partner activity comprising incubating cells that 
30 produce a kinase of the invention in the presence of a compound and detecting changes in 
the level of kinase activity or kinase binding partner activity. The compounds thus 
identified would produce a change in activity indicative of the presence of the compound. 
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The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. Once the compound is identified it can be isolated using techniques well 
known in the art. 

The present invention also encompasses a method of agonizing (stimulating) or 
5 antagonizing kinase associated activity in a mammal comprising administering to said 

mammal an agonist or antagonist to a kinase of the invention in an amount sufficient to 
effect said agonism or antagonism. A method of treating diseases in a mammal with an 
agonist or antagonist of kinase activity comprising administering the agonist or antagonist 
to a mammal in an amount sufficient to agonize or antagonize kinase associated functions 

10 is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein kinases. Some small organic molecules form a class of compounds that modulate 
the function of protein kinases. Examples of molecules that have been reported to inhibit 

15 the function of protein kinases include, but are not limited to, bis monocyclic, bicyclic or 

heterocyclic aryl compounds (PCT WO 92/20642, published November 26, 1992 by 
Maguire et al), vinylene-azaindole derivatives (PCT WO 94/14808, published July 7, 
1994 by Ballinari et aL), l-cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), 
styryl compounds (U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. 

20 Patent No. 5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 Al), 

seleoindoles and selenides (PCT WO 94/03427, published February 17, 1994 by Denny et 
al), tricyclic polyhydroxylic compounds (PCT WO 92/21660, published December 10, 
1992 by Dow), and benzylphosphonic acid compounds (PCT WO 91/1 5495, published 
October 17, 1991 by Dow et al). 

25 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein kinase inhibitors 
only weakly inhibit the function of protein kinases. In addition, many inhibit a variety of 
protein kinases and will cause multiple side-effects as therapeutics for diseases. 

30 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et al) describes hydrosoluble indolinone compounds that harbor tetralin, 
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naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar moieties including hydroxylated alkyl, 
phosphate, and ether moieties. U.S. Patent Application Serial Nos. 08/702,232, filed 
August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 221/187) 
and 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 223/298) and 
International Patent Publication WO 96/22976, published August 1, 1996 by Ballinari et 
al, all of which are incorporated herein by reference in their entirety, including any 
drawings, describe indolinone chemical libraries of indolinone compounds harboring other 
bicyclic moieties as well as monocyclic moieties fused to the oxindole ring. Applications 
08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon 
Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 
223/298), and WO 96/22976, published August 1, 1996 by Ballinari et al. teach methods 
of indolinone synthesis, methods of testing the biological activity of indolinone 
compounds in cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulating kinase activity include, but 
are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al, EPO Publication No. 0 520 722 
Al; Jones et al, U.S. Patent No.4,447,608; Kabbe et al, U.S. Patent No. 4,757,072; Kaul 
and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al, 
Proc. of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 
3:293-304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et 
al, Br. J. Cancer 53:361-368 (1986); Fernandes et al, Cancer Research 43:1 1 17-1 123 
(1983); Ferris et al J. Ore. Chem. 44(2): 173- 178; Fry et al, Science 265:1093-1095 
(1994); Jackman el al, Cancer Research 51:5579-5586 (1981); Jones et al J. Med. Chem. 
29(6):1 1 14-1 1 1 8; Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, L 



WO 00/73469 



96 



PCT/US00/14842 



Ore. Chem. 54:3511-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); 
Maxwell et al, Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al, Cancer 
Research 45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 1 7(1 9): 1 489-1 596 
(1980); Reece et al, Cancer Research 47(1 1):2996-2999 (1977); Sculier et al, Cancer 
5 Immunol, and Immunother. 23:A65 (1986); Sikora et al, Cancer Letters 23:289-295 

(1984); Sikora et al, Analytical Biochem. 172:344-355 (1988); all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

10 Quinolines are described in Dolle et al, J. Med. Chem. 37:2627-2629 (1994); 

MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 
(1993); and Burke et al BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et al, Clin. Exp. Immunol. 91 :1 41 -156 (1993); 

15 Anafi etnl Blond 82:12:3524-3529 (1993); Baker et al. J. Cell Sci. 102:543-555 (1992); 

Bilder et al, Amer. Physiol . Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al, 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al, Experimental 
Cell Research 199:255-261 (1992); Done et al, J. Leukocyte Biology 53:53-60 (1993): 
Dong et al, J. Immunol. 151(5):271 7-2724 (1993); Gazit et al, J. Med. Chem. 32:2344- 

20 2352 (1989); Gazit et al, " J. Med. Chem. 36:3556-3564 (1993); Kaur et al, Anti-Cancer 
Drugs 5 :2 1 3-222 ( 1 994); Kaur et al. , King et al , Biochem. J. 275:413-41 8 (1991); Kuo et 
al, Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 
Lyallerfl/.. J.Biol. Chem. 264:14503-14509(1989); Peterson et al, The Prostate 22:335- 
345 (1993); Pillemer et al, Int. J. Cancer 50:80-85 (1992); Posner et al, Molecular 

25 Pharmacology 45:673-683 (1 993); Rendu et al, Biol. Pharmacology 44(5):88 1-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, J, 
Pharm. and Experimental Therapeutics 267(3): 119-1125 (1 993); Wolbring et al , J. Biol. 
Chem. 269(36):22470-22472 (1994); and Yoneda et al, Cancer Research 51 :4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 

30 drawings. 
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Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 
incorporated herein by reference in its entirety, including any drawings. 
VII. Biological Significance. Applications and Clinical Relevance of Novel Protein 

Kinases 

For each protein kinase in this application, we provide a classification of the 
protein class and family to which it belongs, a summary of non-cataltyic protein motifs, a 
profile of its expression in several hundred tissue and cell sources, and a chromosomal 
location. This information can be used to suggest potential function, regulation or 
therapeutic utility for each of the proteins. 

The kinase classification and protein domains often reflect pathways, cellular roles, 
or mechanisms of up- or down-stream regulation. Also disease-relevant genes often occur 
in families of related genes. For example if one member of a kinase family functions as an 
oncogene, a tumor suppressor, or has been found to be disrupted in an immune, 
neurologic, cardiovascular, or metabolic disorder, frequently other family members may 
play a related role. 

The expression analysis organizes kinases into groups that are transcriptionally 
upregulated in tumors and those that are more restricted to specific tumor types such as 
melanoma or prostate. This analysis also identifies genes that are regulated in a cell cycle 
dependent manner, and are therefore likely to be involved in maintaining cell cycle 
checkpoints, entry, progression, or exit from mitosis, oversee DNA repair, or are involved 
in cell proliferation and genome stability. Expression data also can identify genes 
expressed in endothelial sources or other tissues that suggest a role in angiogenesis, 
thereby implicating them as targets for control of diseases that have an angiogenic 
component, such as cancer, endometriosis, retinopathy and macular degeneration, and 
various ischemic or vascular pathologies. A proteins' role in cell survival can also be 
suggested based on restricted expression in cells subjected to external stress such as 
oxidative damage, hypoxia, drugs such as cisplatinum, or irradiation. Metastases- 
associated genes can be implicated when expression is restricted to invading regions of a 
tumor, or is only seen in local or distant metastases compared to the primary tumor, or 
when a gene is upregulated during cell culture models of invasion, migration, or motility. 
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Chromosomal location can identify candidate targets for a tumor amplicon or a 
rumor-suppressor locus. Summaries of prevelant tumor amplicons are available in the 
literature, and can identify tumor types to experimentally be confirmed to contain 
amplified copies of a kinase gene which localizes to an adjacent region. 
5 Based on these criteria several kinases immediately stand out as being of potential 

therapeutic relevance. The protein kinases can be divided into the following disease- 
relevant categories (nucleotide Seq ID #s in parentheses): 

Tumor associated: Mok (SEQ ID NO:NO:57), EPK2, AA3 16804 (SEQ ID 
NO:l 1), AA435956 (SEQ ID NO:NO:48), AA278842 (SEQ ID NO:88), AA599286 (SEQ 
10 ID NO:89), AA826850 (SEQ ID NO:3), HR1 (SEQ ID NO:73), MLK4 AA232253 (SEQ 

ID NO:82), AA883975 SGK 235 (SEQ ID NO:95), AA31 1714 (SEQ ID NO: 101), 
MPSK1 (SEQ ID NO:l 10), R19609 (Seq ID1 1 1), AA383293 (SEQ ID NO:26). 

Prostate-specific: AA234451 (SEQ ID NO:47), TSK4 (SEQ ID NO:93), RIP4 
(SEQ ID NO:84), K1AA0965 (SEQ ID NO:8). 
15 Oncogenic or proliferation associated: KIAA0781 (SEQ ID NO:38), AA789239 

(SEQ ID NO:52), CCRK (SEQ ID NO:54), CLK4 (SEQ ID NO:55), H85389 (SEQ ID 
NO:97). 

Neuronal restricted: CAMKKB (SEQ ID NO:66) 

Hematopoietic expressed: PTK9L (SEQ ID NO:22), DRAK2 (SEQ ID NO:29), 
20 A1025291 (SEQ ID NO:94) 

Angiogenic or endothelial expressed: DRAK1 (SEQ ID NO:31), MAK-V (SEQ 
ID NO:40), TRAD (SEQ ID NO:44), MOK (SEQ ID NO:57), AA08847 (SEQ ID NO:78), 
HGP_66444466 (SEQ ID NO:79), RSK4 (SEQ ID NO: 16). 

Cell cycle regulated: AA454060 (SEQ ID NO:45), KIAA0999 (Mitotic - SEQ ID 
25 NO:32), AA57964 1 (Mitotic - SEQ ID NO:60), AA305 1 76 (Mitotic - SEQ ID NO:6), 
AA018361 (SI phase- SEQ ID NO: 100). 
Vm. Transgenic Animals. 

A variety of methods are available for the production of transgenic animals 
associated with this invention. DNA can be injected into the pronucleus of a fertilized egg 
30 before fusion of the male and female pronuclei, or injected into the nucleus of an 

embryonic cell (e.g., the nucleus of a two-cell embryo) following the initiation of cell 
division (Brinster el al, Proc. Nat. Acad. Sci. USA 82: 4438-4442, 1 985). Embryos can 
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be infected with viruses, especially retroviruses, modified to carry inorganic-ion receptor 
nucleotide sequences of the invention. 

Pluripotent stem cells derived from the inner cell mass of the embryo and 
stabilized in culture can be manipulated in culture to incorporate nucleotide sequences of 
5 the invention. A transgenic animal can be produced from such cells through implantation 

into a blastocyst that is implanted into a foster mother and allowed to come to term. 
Animals suitable for transgenic experiments can be obtained from standard commercial 
sources such as Charles River (Wilmington, MA), Taconic (Germantown, NY), Harlan 
Sprague Dawley (Indianapolis, IN), etc. 
10 The procedures for manipulation of the rodent embryo and for microinjection of 

DNA into the pronucleus of the zygote are well known to those of ordinary skill in the art 
(Hogan el al. 9 supra). Microinjection procedures for fish, amphibian eggs and birds are 
detailed in Houdebine and Chourrout (Experientia 47: 897-905, 1991). Other procedures 
for introduction of DNA into tissues of animals are described in U.S. Patent No., 
15 4,945,050 (Sanford el ai 9 July 30, 1990). 

By way of example only, to prepare a transgenic mouse, female mice are induced 
to superovulate. Females are placed with males, and the mated females are sacrificed by 
C0 2 asphyxiation or cervical dislocation and embryos are recovered from excised 
oviducts. Surrounding cumulus cells are removed. Pronuclear embryos are then washed 
20 and stored until the time of injection. Randomly cycling adult female mice are paired with 

vasectomized males. Recipient females are mated at the same time as donor females. 
Embryos then are transferred surgically. The procedure for generating transgenic rats is 
similar to that of mice (Hammer et al. t Cell 63:1099-1 112, 1990). 

Methods for the culturing of embryonic stem (ES) cells and the subsequent 
25 production of transgenic animals by the introduction of DNA into ES cells using methods 

such as electroporation, calcium phosphate/DNA precipitation and direct injection also are 
well known to those of ordinary skill in the art (Teratocarcinomas and Embryonic Stem 
Cells, A Practical Approach, E.J. Robertson, ed. 3 IRL Press, 1987). 

In cases involving random gene integration, a clone containing the sequence(s) of 
30 the invention is co-transfected with a gene encoding resistance. Alternatively, the gene 

encoding neomycin resistance is physically linked to the sequence(s) of the invention. 



WO 00/73469 



100 



PCT/US00/14842 



Transfection and isolation of desired clones are carried out by any one of several methods 
well known to those of ordinary skill in the art (E.J. Robertson, supra). 

DNA molecules introduced into ES cells can also be integrated into the 
chromosome through the process of homologous recombination (Capecchi, Science 244: 
5 1288-1292, 1989). Methods for positive selection of the recombination event (i.e., neo 

resistance) and dual positive-negative selection (i.e., neo resistance and gancyclovir 
resistance) and the subsequent identification of the desired clones by PCR have been 
described by Capecchi, supra and Joyner et ai (Nature 338: 1 53-1 56, 1 989), the teachings 
of which are incoiporated herein in their entirety including any drawings. The final phase 

1 0 of the procedure is to inject targeted ES cells into blastocysts and to transfer the 

blastocysts into pseudopregnant females. The resulting chimeric animals are bred and the 
offspring are analyzed by Southern blotting to identify individuals that carry the transgene. 
Procedures for the production of non-rodent mammals and other animals have been 
discussed by others (Houdebine and Chourrout, supra; Pursel et ai, Science 244:1281- 

15 1288, 1989; and Simmse/ a/., Bio/Technology 6:179-183, 1988). 

Thus, the invention provides transgenic, nonhuman mammals containing a 
transgene encoding a kinase of the invention or a gene effecting the expression of the 
kinase. Such transgenic nonhuman mammals are particularly useful as an in vivo test 
system for studying the effects of introduction of a kinase, or regulating the expression of 

20 a kinase (i.e., through the introduction of additional genes, antisense nucleic acids, or 

ribozymes). 

A "transgenic animal" is an animal having cells that contain DNA which has been 
artificially inserted into a cell, which DNA becomes part of the genome of the animal 
which develops from that cell. Preferred transgenic animals are primates, mice, rats, 
25 cows, pigs, horses, goats, sheep, dogs and cats. The transgenic DNA may encode human 

STE20-related kinases. Native expression in an animal may be reduced by providing an 
amount of anti-sense RNA or DNA effective to reduce expression of the receptor. 

IX. Gene Therapy 

30 Protein kinases of the invention, or their genetic sequences will also be useful in 

gene therapy (reviewed in Miller, Nature 357:455-460, 1992). Miller states that advances 
have resulted in practical approaches to human gene therapy that have demonstrated 
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positive initial results. The basic science of gene therapy is described in Mulligan 
(Science 260:926-931, 1993). 

In one preferred embodiment, an expression vector containing protein kinase 
coding sequence is inserted into cells, the cells are grown in vitro, and then are infused in 
5 large numbers into patients. In another preferred embodiment, a DNA segment containing 

a promoter of choice (for example a strong promoter) is transferred into cells containing 
an endogenous gene encoding kinases of the invention in such a manner that the promoter 
segment enhances expression of the endogenous kinase gene (for example, the promoter 
segment is transferred to the cell such that it becomes directly linked to the endogenous 
10 kinase gene). 

The gene therapy may involve the use of an adenovirus containing kinase cDNA 
targeted to a tumor, systemic kinase increase by implantation of engineered cells, injection 
with kinase-encoding virus, or injection of naked kinase DNA into appropriate tissues. 

Target cell populations may be modified by introducing altered forms of one or 
1 5 more components of the protein complexes in order to modulate the activity of such 

complexes. For example, by reducing or inhibiting a complex component activity within 
target cells, an abnormal signal transduction event(s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense mutants of a component, that 
retain the ability to interact with other components of the protein complexes but cannot 
20 function in signal transduction may be used to inhibit an abnormal, deleterious signal 

transduction event. 

Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adenovirus, adeno-associated virus, herpes viruses, several RNA viruses, or bovine 
papilloma virus, may be used for delivery of nucleotide sequences (e.g., cDNA) encoding 

25 recombinant kinase of the invention protein into the targeted cell population {e.g., tumor 

cells). Methods which are well known to those skilled in the art can be used to construct 
recombinant viral vectors containing coding sequences (Maniatis et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1989; Ausubel et 
al, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 

30 lnterscience, N.Y., 1989). Alternatively, recombinant nucleic acid molecules encoding 

protein sequences can be used as naked DNA or in a reconstituted system e.g., liposomes 
or other lipid systems for delivery to target cells (e.g., Feigner et al, Nature 337:387-8, 
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1989). Several other methods for the direct transfer of plasmid DNA into cells exist for 
use in human gene therapy and involve targeting the DNA to receptors on cells by 
complexing the plasmid DNA to proteins (Miller, supra). 

In its simplest form, gene transfer can be performed by simply injecting minute 
5 amounts of DNA into the nucleus of a cell, through a process of microinjection (Capecchi, 

Cell 22:479-88, 1980). Once recombinant genes are introduced into a cell, they can be 
recognized by the cell's normal mechanisms for transcription and translation, and a gene 
product will be expressed. Other methods have also been attempted for introducing DNA 
into larger numbers of cells. These methods include: transfection, wherein DNA is 

10 precipitated with CaP0 4 and taken into cells by pinocytosis (Chen et al., Mol. Cell Biol. 

7:2745-52, 1987); electroporation, wherein cells are exposed to large voltage pulses to 
introduce holes into the membrane (Chu et ai, Nucleic Acids Res. 15:131 1-26, 1987); 
lipofection/liposome fusion, wherein DNA is packaged into lipophilic vesicles which fuse 
with a target cell (Feigner et ai 3 Proc. Natl. Acad. Sci. USA. 84:7413-7417, 1987); and 

15 particle bombardment using DNA bound to small projectiles (Yang et al. 9 Proc. Natl. 

Acad. Sci. 87:9568-9572, 1990). Another method for introducing DNA into cells is to 
couple the DNA to chemically modified proteins. 

It has also been shown that adenovirus proteins are capable of destabilizing 
endosomes and enhancing the uptake of DNA into cells. The admixture of adenovirus to 

20 solutions containing DNA complexes, or the binding of DNA to polylysine covalently 

attached to adenovirus using protein crosslinking agents substantially improves the uptake 
and expression of the recombinant gene (Curiel et ai, Am. J. Respir. Cell. Mol. Biol., 
6:247-52, 1992). 

As used herein "gene transfer" means the process of introducing a foreign nucleic 
25 acid molecule into a cell. Gene transfer is commonly performed to enable the expression 

of a particular product encoded by the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically active RNA. Gene transfer can be 
performed in cultured cells or by direct administration into animals. Generally gene 
transfer involves the process of nucleic acid contact with a target cell by non-specific or 
30 receptor mediated interactions, uptake of nucleic acid into the cell through the membrane 
or by endocytosis, and release of nucleic acid into the cytoplasm from the plasma 
membrane or endosome. Expression may require, in addition, movement of the nucleic 
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acid into the nucleus of the cell and binding to appropriate nuclear factors for 
transcription. 

As used herein "gene therapy" is a form of gene transfer and is included within the 
definition of gene transfer as used herein and specifically refers to gene transfer to express 
a therapeutic product from a cell in vivo or in vitro. Gene transfer can be performed ex 
vivo on cells which are then transplanted into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid-protein complex into the patient. 

In another preferred embodiment, a vector having nucleic acid sequences encoding 
a protein kinase polypeptide of the invention is provided in which the nucleic acid 
sequence is expressed only in specific tissue. Methods of achieving tissue-specific gene 
expression are set forth in International Publication No. WO 93/09236, filed November 3, 
1992 and published May 13, 1993. 

In all of the preceding vectors set forth above, a further aspect of the invention is 
that the nucleic acid sequence contained in the vector may include additions, deletions or 
modifications to some or all of the sequence of the nucleic acid, as defined above. 

In another preferred embodiment, a method of gene replacement is set forth. 
"Gene replacement" as used herein means supplying a nucleic acid sequence which is 
capable of being expressed in vivo in an animal and thereby providing or augmenting the 
function of an endogenous gene that is missing or defective in the animal. 
X. Administration of Substances 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 
WO 96/22976, published August 1 1996, both of which are incorporated herein by 
reference in their entirety, including any drawings, figures, or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used, and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
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takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
tumors, and major organs can be also be determined to facilitate the selection of drugs 
5 most appropriate to inhibit a disorder. Such measurements can be carried out. For 

example, HPLC analysis can be performed on the plasma of animals treated with the drug 
and the location of radiolabeled compounds can be determined using detection methods 
such as X-ray, CAT scan, and MRI. Compounds that show potent inhibitory activity in 
the screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

10 altering the chemical structure and retesting. In this regard, compounds displaying good 

pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 1) 
the compound is administered to mice (an untreated control mouse should also be used); 

15 2) blood samples are periodically obtained via the tail vein from one mouse in each 

treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition, and the percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

20 At the termination of each toxicity study, further studies can be carried out by 

sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc., 202:229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 

25 necropsy for immediate evidence of metastasis, unusual illness, or toxicity. Gross 

abnormalities in tissue are noted, and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 
effect the less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not limiting and are merely representative of various 
aspects and features of the present invention. The examples below demonstrate the 
isolation and characterization of the protein kinases of the invention. 

EXAMPLE 1: Isolation of cDNA clones Encoding Novel Mammalian Protein Kinases 
Materials and Methods Identification from cDNA databases and isolation of clones 
encoding novel protein kinases 

Novel kinases were identified from the public EST databases using a Hidden 
Markov model, abbreviated HMM (Krogh, A., Brown, M., Mian, I. S., Sjolander, K., and 
Haussler, D. 1994. Hidden Markov models in computational biology: Applications to 
protein modeling. J. Mol Biol, 235:1501-1531). The model was built with 70 
mammalian and yeast kinase catalytic domain sequences. These sequences were chosen 
from a comprehensive collection of kinases such that no two sequences had more than 
50% sequence identity. ESTs were translated in six open reading frames and were 
searched against the model. ESTs that had a score of at least 10 against the HMM were 
then masked for repetitive sequences and vectors and were clustered using MSA. The 
resulting contigs were searched against known kinases to identify EST clones that encode 
novel kinases. 

Approximately 40% of the ESTs encoding potentially novel kinases did not 
correspond to the correct EST upon sequence analysis. Most of these discrepancies were 
resolved by ordering additional clones, however, 14 remained unavailable. These 14 ESTs 
were amplified from a variety of single-stranded cDNA sources with primers derived from 
the corresponding EST entry as shown on Table 5. The PCR product was subcloned into a 
bluescript vector, digested to confirm the presence of a correct size insert and sequenced. 
Full sequencing of EST and PCR was carried out using a cycle sequencing Big-dye kit 
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with AmpliTaq DNA Polymerase, FS (AB1, Foster City, CA). Sequencing reaction 
products were run on an ABI Prism 377 DNA Sequencer. 

Table 5: Primers used to clone PCR products corresponding to novel kinases 





ID# 


1D# 


Parent 


5' primer 


3* primer 


sp 


na 


aa 


Sequence 


Sequence* 


Sequence* 


H 


33 


153 


2R22-5-1 1 


GAGATCGRNTTYAARGA 
RTTYGA 


TGTCACNCCNAGNSWCCAN 
AYRTT 


M 


81 


200 


5R57_10_2_ 
m TESK2_m 


GCTGCTGGACAGTGACT 
TGTATTT 


GAAAGCAAAGCCTTCACAC 
CTT 


H 


67 


187 


5R69_17_2_h 


CTCTCACCTCAGGAACT 
GG 


GCTTGCGGATCTTCTCA 


H 


46 


166 


SGK309_h 


GACATCCTGCCGGCCAA 
CTACG 


CGGCCCTGGAGCTGCATCA 
CTA 


M 


67 


228 


5R72J6_2_h 


TGCGCGACACCATTGAC 
CAG 


CTCAGGGCTTACATACAGA 
G 


H 


45 


165 


5R72_8_2_h 


AAAGGAGAACTACATTT 
TGAAAAT 


CTTCATCATCTCTAATACAT 
TGGTTGG 


H 


41 


161 


Z36720 


CAAATTAAGATCATTGA 
CTTTGGG 


GGAAACAAAGTCCTTGGCC 
TC 


H 


115 


234 


AL031652- 
Pak6 


GTGGACATCTGGTCCCT 
CG 


GTAGGTCCTTCACTCTTGG 
AG 



• degenerate oligonucleotide residue designation: 
N= A,C,G ot T 



R= A or G 
Y= C or T 
S = C or G 
W= A or T 

Full-length sequence extension of protein kinases using cDNA and genomic databases 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was carried out by iterative blastn searching of the cDNA databases listed in Table 
6. All blastn searches were conducted using a blosum62 matrix, a penalty for a nucleotide 
mismatch of -3 and reward for a nucleotide match of 1 . The gapped blast algortihm is 
described in: (Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and 
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PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 
25:3389-3402). 

Table 6. Databases used for cDNA-based sequence extensions 



Database 




LifeGold templates 


Feb 2000 


LifeGold compseqs 


Feb 2000 


LifeGold compseqs 


Mar 2000 


LifeGold compseqs 


Apr 2000 


LifeGold fl 


Feb 2000 


LifeGold flft 


Apr 2000 


NCB1 human Ests 


May 2000 


NCBI murine Ests 


May 2000 


NCBI nonredundant 


May 2000 



Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was also carried out by iterative searches of genomic databases. Three methods 
were used. The first method made use of the Smith-Waterman algorithm to carry out 
protein-protein searches of the closest homologue or orthologue to the partial kinase. The 
target databases consisted of Genescan and open-reading frame (ORF) predictions of all 
human genomic sequence derived from the human genome project (HGP) as well as from 
Celera. The complete set of genomic databases searched is shown in Table 7 below. 
Genomic sequences encoding potential extensions were further assessed by blastp analysis 
against the NCBI nonredundant to confirm the novelty of the hit. The extending genomic 
sequences were incorporated into the cDNA sequence after removal of potential introns 
using the Seqman program from DNAStar. The default parameters used for Smith- 
Waterman searches were as shown next. Matrix: blosum 62; gap-opening penalty: 12; gap 
extension penalty: 2. Genescan predictions were made using the Genescan program as 
detailed in (Chris Burge and Sam Karlin "Prediction of Complete Gene Structures in 
Human Genomic DNA", J MB (1997) 268(1 ):78-94). ORF predictions from genomic 
DNA were made using a standard 6-frame translation. 
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The second method for genomic sequence-based extensions made use of tBlastn 
searches of the homologue or orthologue to the partial kinase against the cDNA databases 
listed in Table 7. The recognition of significant hits in these databases made possible to 
identify bridging partial cDNA clones. The iterative application of the two methods made 
possible the assemblage of the virtual full-length sequence for a large number of the 
kinases presented in this application. All tblastn searches were conducted using a 
blosum62 matrix, a penalty for a nucleotide mismatch of -3 and reward for a nucleotide 
match of 1 . 

The last method for defining cDNA extensions from genomic sequence used 
iterative searches of genomic databases through the Genescan program to predict ex on 
splicing and the Genewise program (http://www.sanger.ac.uk/SoftwareAVise2/ ) to predict 
potential ORFs based on homology to the closest orthologue/homologue. 
Table 7. Databases used for genomic-based sequence extensions 



Database 


Number of entries 


Database Date 


Celera v. 1-5 


5,306,158 


Jan 19/00 


Celera v. 6-10 


4,209,980 


Mar 24/00 


Celera v. 11-14 


7,222,425 


Apr 24/00 


Celera v. 15 


243,044 


May 14/00 


HGP all Genescan 


25,885 


Apr 04/00 


HGP; Phase 0 


4,944 


May 04/00 


HGP; Phase 1 


28,478 


May 05/00 


HGP; Phase 2 


1,508 


May 04/00 


HGP; Phase 3 


9,971 


May 05/00 



Virtual Extensions 

Human AA826850 (SEQ ID NO: 3, SEQ ED NO: 124) 
Blastn analysis of the partial AA826850 sequence revealed an extension to 
encompass the complete ORF in the lncyte EST 238299.1 . A frame-shift correction at 
position 595 of this EST (marked by X in NA sequence) generated an uninterrupted ORf . 
Human AA960957 (SEQ ID NO: 4, SEQ ID NO: 125) 
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Since the initial filing of this application, the partial AA960957 sequence appeared 
in the public database as the full-length gene for a protein kinase encoded by a gene that 
maps adjacent to the eve (AJ250839) (ellis-van creveld syndrome and weyers acrodental 
dysostosis) gene from 4pl6.1. 
5 Human 5R79-46-l_h (SEQ ID NO: 5, SEQ ID NO:126) 

Blastn analysis of the partial 5R79-46-1 sequence revealed an extension to 
encompass the complete ORE in the lncyte EST 463894.6. Since the initial filing of this 
application, the full-length virtual 5R79-46-1 appeared in the public database as the full- 
length gene for the TANK-binding kinase (TBK1) (Pomerantz,J.L. and Baltimore,D. 
10 (1999) EMBO J. 18 (23), 6694-6704). TBK1 participates in NF-kB activation through the 

formation of a signaling complex with TRAF2 and TANK. 

Human AA305 1 76 (SEQ ID NO: 6, SEQ ID NO: 1 27) 

Blastn analysis of the partial AA3051 76 sequence revealed an extension to 
encompass the complete ORF in the lncyte EST 220937.1. 
15 Human AA256100 (SEQ ID NO: 8, SEQ ID NO: 129) 

Blastn analysis of the partial AA256100 sequence revealed an extension to 
encompass the complete ORF through the assembly of three partial clones: lncyte EST 
480815.6, KIAA0965 (BAA76809) and AA256100. 

Human AA2 1 0825 (SEQ ID NO: 9, SEQ ID NO: 1 30) 
20 Blastn analysis of the partial AA21 0825 sequence revealed an extension to 

encompass the nearly complete ORF through the assembly of three partial clones: lncyte 
EST 014721.7, and the NCBI EST's AW01 158 and AA210825. An insertion of two "N's" 
at positions 1915 and 1916 generated an uninterrupted ORF. Blastx analysis indicated the 
possibility of a start Met in the range of 400-450 nucleotides (i.e. compared to the closest 
25 homolog, human PKCmu (CAA53384.1). However, no Met was found in this region; 

rather ORF ends in an in-frame stop preceeded by the sequence 

t4 RGLLAPGDPPCPPPNPAPATPPSSRLPTELFSNFCDS ,> . It is possible that part of the 
sequence covered by nucleotide positions 1-400 derived from AW01 158 comes from an 
intron, explaining the absence of a start Met. 
30 Human AA 127299 (SEQ ID NO: 10, SEQ ID NO: 131) 

No entries in the database extended this sequence. The 1684 bp insert of this EST 
contains a 1369 bp intron at the 3' end. Blastx and SW analysis of the 315 bp coding 
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region revealed homology to the extracatalytic C2 domain of PKC. This EST, may or may 
not encode a kinase. 

Human AA3 16804 (SEQ ID NO:l 1, SEQ ID NO: 132) 

Since the initial filing of this application, the partial AA3 16804 sequence appeared 
5 in the public database as the full-length gene for the PKC family protein kinase EPK2 or 

PKCnu (AB015982). 

Human H 1 9 1 02 (SEQ ID NO: 1 4, SEQ ID NO: 1 35) 

Genewise and Genescan analyses of the partial HI 91 02 sequence revealed an 
extension from the HGP phase 3 contig 3810672 to encompass the complete catalytic 

10 domain of this EST. Blastn analysis against the non-redundant database revealed that this 

gene is found in the cosmid AC005726 from chromosome 17. HI 91 02 may encode a dual 
catalytic kinase given the homology to S6 kinase. Analysis of genomic sequence upstream 
of the 5' end of HI 9102 revealed a non-kinase gene oriented in the same polarity as 
HI 9 102 suggestive of the start Met for HI 91 02 being close to the 5' end of the HI 91 02 

15 sequence. From this analysis it is deduced that the second catalytic domain of HI 91 02, if 

present, is most likely located within the 47334-185,215 bp region of the genomic 
sequence of AC005726. 

Human AA476563 (SEQ ID NO: 15, SEQ ID NO: 136) 

Since the initial filing of this application, the partial AA476563 sequence appeared 
20 in the public database as the full-length gene for the protein kinase RPS6KC1 

(NMJH2424) (Zhang, H. et al Genomics (1999) 61, 314-318), which is an S6 kinase 
mapping to 12ql2-ql3.1. 

Human AA626690 (SEQ ID NO: 1 6, SEQ ID NO: 1 37) 

Since the initial filing of this application, the partial AA626690 sequence appeared 
25 in the public database as the full-length gene for the protein kinase RPS6KA6 (AF1 84965) 

(Yntema,H.G et al (1999) Genomics 62, 332-343), an S6 kinase commonly deleted in 
patients with complex X-linked (Xq21.1 ) mental retardation. 
Human A12 1 5680 (SEQ ID NO: 1 7, SEQ ID NO: 1 38) 

Since the initial filing of this application, the partial AI2 15680 sequence appeared 
30 in the public database as the frill-length gene encoding a hypothetical protein (AAD30182) 
from the locus AC006530.4 from chromosome 14. 

Human AA887783 (SEQ ID NO:21, SEQ ID NO: 142) 
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Blastn analysis of the partial AA887783 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: lncyte 
415390R6 and the NCBI EST's AA887783 and N94726. Since the initial filing of this 
application, the nearly full-length virtual AA887783 sequence appeared in the public 
database as the fiill-length gene encoding SGK3 (AF1 69035), a serum- and 
glucocorticoid-induced protein kinase (Kobayashi,T. et al (1999) Biochemical J. 344, 189- 
197. 

Human R47805 (SEQ ID NO:22, SEQ ID NO: 143) 

A cDNA clone encoding the fiill-length ORF of R47805 was isolated using 
R47805 as a screening probe. A full-length form for R47805 has also appeared in the 
public database as 

PTK9L (NM_007284), an A6-related protein kinase. 

Human H602 1 5 (SEQ ID NO:23, SEQ ID NO: 1 44) 

Blastn analysis of the partial H60215 sequence revealed an extension to encompass 
the complete ORF in the public EST AI275726. This was confirmed through the full insert 
sequencing of this EST (2,310 bp) which corresponds to the sequence under SEQ ID 
NO: 144. 

A different stop codon was predicted for AI275726 compared to H6021 5 due to a 
single nucleotide insertion at position 1586 in AI275726. Evidence for the extra nucleotide 
comes from EST A1191922. 

SGK324_h orthologue of W30246_m (SEQ ID NO:24 , SEQ ED NO: 145) 

Blastn, blastx and Smith-Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding to the human orthologue of 
murine W30246. Exons predicted from the following sequences were used for contig 
construction: Celera 17000189645083, 17000057549105 and 1 1000501939981; 
Incytel42404.1,HGP_7249119, lncyte 71 96489H1, Celera 1 1000501939981, 
17000028165594; lncyte 72491 19^3, Celera 17000035772368, 11000502081575 and 
17000140274329. The latter Celera sequence provides the N-terminus. 

Human AA383293 (SEQ ED NO:26, SEQ ID NO:147) 

Blastn, blastx and Smith- Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding for AA383293. Exons predicted 
from the following sequences were used for contig construction: (numbers in parenthesis 
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refer to the aa sequence of the closest homolog (RU2S, NP_057440) used for the Smith- 
Waterman query): N-term from Incyte 6010175_2 (14-97), Incyte 6981981 (134-184) 
7596749 (186-232) Celera 17000020789545 (243-301) CAB75619.1 (310-341)~(56-145 
DCX homology) 6010175 J2 , Celera 17000030058129 (241-262 DCX homology). 
5 Human AA021445 (SEQ ID NO:32, SEQ ID NO: 152) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. Contig reconstruction was as follows: nucleotides 1-802 
from KIAA0999 (AB023216); nucleotides 803-4321 from full-insert sequence of 
AA021445. A pairwise alignment between the AA021445 and K1AA0999 revealed three 
10 inserts in the extracatalytic C-terminus of 48, 48 and 161 aminoacids. In addition, both 

AA021445 and KIAA0999 have 15 copies of a CAG repeat. Trinucleotide repeats are 
often found in genes that linked to neurodegenerative diseases. 
Human 2R22-55-1 (SEQ ID NO:33, SEQ ID NO: 153) 
Blastn analysis revealed an extension in the Incyte EST clone 321074.1 to 
1 5 encompass the complete ORF corresponding to 2R22-55-1 . 

Human orthologue of AA544838_m (SEQ ID NO:36, SEQ ID NO: 156) 
tBlastn analysis identified the partial human K1AA0135 (U79240) clone as the 
human orthologue of murine AA544838. Blastn revealed an extension KIAA0135_h 
(U79240) to encompass the complete ORE. The full ORF was reconstructed from 
20 lncyte406786.5, KFZp430051 and KIAA0135 (U79240). 

Human orthologue of AI785735_m (SEQ ID NO:38, SEQ ED NO: 1 58) 
tBlastn analysis identified the partial human KIAA0781 (AB018324) clone as the 
human orthologue of murine AI785735. Blastn revealed an extension KIAA01 35_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from Incyte 
25 986123.37 KIAA0781 (AB018324). 

Human AA207220 (SEQ ID NO: 39, SEQ ID NO: 159) 
Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. The frill ORF was reconstructed from Incyte 402740.1 and 
AA207220. Frame corrections: deletion of 441 and 595 over Inc402740.1 seq based on 
30 blastx to keep frame open; two n insertions 940, 941 over AA207220 to keep frame open. 

Human AA426580 (SEQ ID NO:40, SEQ ID NO: 160) 
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Since the initial filing of this application, the partial AA426580 sequence appeared 
in the public database as the full-length gene encoding MAK-V (AJ271722) from 
chromosome 21 q22.1. 

Human 5R79-54-1 (SEQ ID NO: 41, SEQ ID NO: 161) 
5 Genewise and Genescan analyses of the partial 5R79-54-1 sequence revealed an 

extension from genomic sequence to encode the full ORF for 5R79-54-1 . 

Human orthologue of AA542015_m (SEQ ID NO: 42, SEQ ID NO: 162) 

tBlastn analysis identified KIAA1297 (AB037718). Blastn extended the 
KIAA1297 sequence to provide the C-terminus through the Incyte 224074.1 EST. The 
10 partial ORF consists of a dual catalytic domain flanked by 6 Ig domains and 2 fibronectin 

repeats. Based on homology to the bt drosophila protein (AAF59316.1), the human form 
of AA542015 is expected to be missing 16 Ig domains. 

Human R19772 (SEQ ID NO:44, SEQ ID NO: 164) 

The full-length ORF for R19772 was isolated by screening a cDNA library using a 
15 probe derived from R19772. Since the initial filing of this application, the R19772 

sequence appeared in the public database as the full-length gene encoding Trio (Duet) 
(AB01 1422). CDNA library screening revealed multiple isoforms for this gene which are 
summarized in the Table below. 

20 Table 8. Isoforms for R 197 72 



Kestrl Name 


Kestrl 
AA Acc # 


Isoform 
type 


Source 


Description* 


Trad (Duet) 


R19772 


B 


Skeletal 
muscle 


Deletion of K at 124 










Deletion ofQat616 










Substitution of E for G at 
762 






C 


Skeletal 
muscle 


Deletion ofK at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 



WO 00/73469 



114 



PCT/US00/14842 











Deletion of 32 aa (160-191) 






D 


Lung tumor 


Deletion of Q at 616 










Deletion of 32 aa (160-191) 






E 


Lung tumor 


Deletion ofQ at 616 










Deletion of 32 aa (160-191) 



* reference amino acid position are with respect to sequence of Trad (AB01 1422) 



Human AA435956 (SEQ ID NO:48, SEQ ID NO: 168) 
5 Blastn analysis revealed an extension to encompass the nearly complete catalytic 

region of AA435956. 5 5 end sequence extension was provided by genomic locus 
AC007242.3_h (range 44880-43801). Based on blastx analysis, the extended sequence 
encodes is full-length at the C-terminus. 

Human AA397553 (SEQ ID NO: 5 1 , SEQ ID NO: 1 7 1 ) 
10 Since the initial filing of this application, the partial AA397553 sequence appeared 

in the public database as the full-length gene encoding CRK7 (AF227198), a novel CDC2- 
related protein kinase that colocalizes with interchromatin granule clusters. 

Human AA789239 (SEQ ID NO: 52, SEQ ID NO: 172) 

Since the initial filing of this application, the partial AA789239 sequence appeared 
1 5 in the public database as the full-length gene encoding NKIAMRE (AF1 30372), a novel 

kinase deleted in human leukemia. 

Human AA631990 (SEQ ID NO:55, SEQ ID NO:175) 
Blastn analysis revealed an extension to encompass the full-length ORF for 
AA631990. The full ORF was reconstructed from 253847.5 and AA631990 and 
20 AA207220. Frame corrections: delete 1 C at 1380, delete 2N's at 2033/2034. 
Human AA557536 (SEQ ID NO:56, SEQ ID NO:176) 
Blastn analysis revealed an extension to encompass full-length ORF for 
AA557536. The full ORF was reconstructed from AA557536, celera 1 1000504061899 
and the Incyte 097089.1 EST. An 85bp intron was removed from AA557536. 
25 Human N34132 (SEQ ID NO: 63, SEQ ID NO:183) 

Full sequencing of EST N34132 (1 .3 kb) confirmed that this cDNA encodes a 
novel NEK-subfamily kinase. Blast analysis against the EST database showed that four 
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EST sequences (AA283140, AA283140, AA28291 1 and N5301 1) extended the sequence 
of N34132 at the 3' end to form a 2.31 kb contig. Blast analysis of the new contig against 
the nonredunat public database showed that the N34132 extended contig overlapped 
(100% identity) over 228 bp at its 3' end with human K1AA0344 (AB002342), a 5, 787 bp 
5 cDNA encoding a 1 246 aa polypeptide. The 5' 790 bp of the K1AA0344 cDNA (encoding 

the 58 N-terminal protein sequence) were found to be divergent with respect to the 
extended 2.32 kb N34132 contig. Evidence that the extended N34132 contig (2.3 lkb) and 
K1AA0344 (AB002342) belong to the same gene is the following. First, blast analysis of 
the nucleotide sequences for N34132 and KIAA0344 against the NRN database confirmed 

10 that these cDNA's are transcribed from the same genomic locus defined by two 

overlapping BACs (AC004765 and AC004803) from chromosome 12pl3.3. Second, full 
sequence determination of a PCR fragment amplified from single-stranded cDNA 
confirmed the junction between the extended N34132 contig and KlAA0344_h 
(AB002342). The 462 PCR product was amplified with primers 

1 5 CTCCTCAACAG ACAGTGCAG (5 5 primer) and G ACATTCTACTACTCGGTCTC (3 ' 

primer) designed from the N34132 extended contig and K1AA0344 sequences, 
respectively. The region of N34132 containing the start Met was isolated by PCR from a 
testis cDNA library (Clontech). 

Human 5R69-1 7-2 (SEQ ID NO:67, SEQ ID NO: 187) 

20 The full-length ORF for 5R69-1 7-2 was isolated by screening a cDNA library 

using a probe derived from 5R69-1 7-2. 

Human H85811 (SEQ ID NO:68, SEQ ID NO: 188) 

Tblastn, Smith-Waterman and blastn analyses using cDNA databases revealed an 
extension to encompass full-length ORF for H8581 1 . The full ORF was reconstructed 
25 from IncyteESTs 202971.8, 034583.3 and 034583.1 and public ESTs H8581 1 and 

AI570599. 

Human R43524 (SEQ ID NO:73, SEQ ID NO:192) 

Blastn analysis revealed an extension to encompass the complete catalytic region 
and the C-terminus of R43524. Since the initial filing of this application, the partial 
30 R43524 sequence appeared in the public database as the full-length gene encoding the 
heme-regulated initiation factor 2-alpha kinase (HRI) (AF181071). 

Human AA088547 (SEQ ID NO:78, SEQ ID NO: 1 97) 
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Genewise and Genescan analyses of genomic databases revealed an extension to 
encompass the complete ORF for AA088547. 

Human orthologue of AA139478_m (SEQ ID NO:80, SEQ ID NO: 199) 
Tblastn identified the Incyte 21 1475.1 as the potential full-length human 
5 orthologue of murine AA 1 39478 

Human AA232253 (SEQ ID NO:82, SEQ ID NO:201) 

The full-length ORF for AA232253 was isolated by screening a cDNA library 
using a probe derived from AA232253. Since the initial filing of this application, the 
AA232253 sequence appeared in the public database as the full-length gene encoding SLK 
10 (AB01 1422). SLK is a stress-regulated mixed lineage kinase-like protein that activation 

of Rac and induction of apoptosis. cDNA library screening revealed multiple isoforms for 
this gene which are summarized in the Table below. 

Table 9. Isofonns for AA232253 

15 



Kestrl 
Name 


Kestrl AA 
Acc# 


Isoform 
type 


Description* 


MLK4 


AA232253 


MLK4 


Substitution of C for W at 346 






MLK4B 


Different Cterm (332-800); seq in MLK4B is as 
shown in * 



* C-terminus specific to MLK4B 

LPLAARMSEESYFESKTEESNSAEMSCQITATSNGEGHGMNPSLQAMMLMGFGD1 
FSMNKAGAVMHSGMQINMQAKQNSS 
20 KTTSKKJIGKKVNMALGFSDFDLSEGDDDDDDDGEEEDNDMDNSE 

Human H97685 (SEQ ID NO:84, SEQ ID NO:203) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
H97685. The full ORF was reconstructed from Incyte 474824.1 and the public ESTs 
25 H97685 andM62021. 

Human AI052250 (SEQ ID NO:87, SEQ ID NO:206) 
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Blastn analysis revealed an extension to encompass the full-length ORF for 
AJ052250. The full ORF was reconstructed from lncyte 396868.1, the public partial 
cDNA FLJ 10074 (minus intron) and the public ESTs and the public ESTs AI052250 and 
H97685, AI499220 and M62021. 
5 Human AA278842 (SEQ ID NO:88, SEQ ID NO:206) 

A nearly full-length cDNA (FL4F12) for AA278842 was isolated by screening a 
cDNA library using a probe derived from AA278842. A full-length virtual ORF was 
generated using FL4F12 and AA278842. 

Human AA599286 (SEQ ID NO:89, SEQ ID NO:208) 
10 Since the initial filing of this application, the partial AA599286 sequence appeared 

in the public database as a full-length ORF (AK000342). 

Human AA425725 (SEQ ID NO:90, SEQ ID NO:209) 

Since the initial filing of this application, the partial AA425725 sequence appeared 
in the public database as MSSK1, a serine kinase gene located from human chromosome 
15 Xq28. 

Human SGK022 orthologue of AA060026_m (SEQ ID NO:91, SEQ ID NO:210) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed a potential human orthologue for murine AA060026. The full-length 
ORF for SGK022 was reconstructed from genomic locus AC022307. 
20 Human AA399669 (SEQ ID NO:93, SEQ ID NO:212) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA399669. The full ORF was reconstructed as follows: sequence 1-1007 from 
AL136295.2; sequencel 008-23 19 from AA399669 and lncyte 428177.1. 

Human AA883975 (SEQ ID NO:95 3 SEQ ID NO:214) 
25 Genescan and Genewise analyses of the genomic databases revealed an extension 

for AA883975 to encompass the full-length ORF 

Human AA905446 (SEQ ID NO:96, SEQ ID NO:215) 

Tblastn, Smith-Waterman and blastn analyses of cDNA and genomic databases 
databases revealed an extension for AA905446 to encompass the full-length ORF. For the 
30 Smith-Waterman analysis murine STK22 ( NP_033462) was used as the closest 

orthologue. Contig formation: range 162133-163687 from HGPJi 6921333_9; removed 
intron (146-893) predicted from blastx analysis. 
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Human H29974 (SEQ ID MO: 97 SEQ ID NO:216) 

Blastn analysis revealed an extension to encompass a complete catalytic ORF for 
AA399669. The nearly full-length ORF was reconstructed using Incyte 213829.1 and 
H29974. 

5 Human AA2 153 11 (SEQ ED NO:99, SEQ ID NO:218) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA21531 . The full ORF was reconstructed from Incyte 067584.1, 022456.1, AA21531 1 
and the reverse complement of CPG 043208. 

Human AA018361 (SEQ ID NO:100, SEQ ID NO:219) 
10 The full-length ORF for AA01 8361 was isolated by screening a cDNA library 

using a probe derived from AA018361 . This yielded clone Sug4-30. Clone Sug4-30, like 
multiple, independent cDNA clones contained a 181 bp intron. The existence of intron-less 
RNA's was confirmed by a PCR reaction that generated a product that upon sequence 
analysis skipped the intron region. The full-length virtual. ORF for AA01 8361 was 
1 5 generated through a contig between AL1 1 7482 (seq 1 -367) and the sequence for clone 

Sug4-30. 

Human orthologue of AA396601_rn (SEQ ID NO:106, SEQ ID NO:225) 
tBlastn and Smith -Waterman analyses of genomic sequence revealed an extension 
to encompass the full catalytic region for the human orthologue of AA396601. The ORF 
20 was reconstructed from Incyte 018653.9 (7261449H1, 6891740J1) and genomic sequence 

CPG_040010. 

Human orthologue of AA671275_m (SEQ ID NO:108, SEQ ID NO:227) 
Since the initial filing of this application, a potential human orthologue for murine 
AA671275 appeared in the public database as the full-length ORF for vaccinia related 
25 kinase 3 (BAA90769). 

Human H05721 (SEQ ID NO: 1 1 1, SEQ ID NO:230) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for H05721 . 

Human AI086865 (SEQ ID NO:l 12, SEQ ID NO:231) 
30 Genescan and Genewise analyses of genomic sequence revealed an extension to 

encompass the full-length ORF for AJ086865. The full-length ORF was reconstructed 
from Celera 17000102901516, Incyte 243269.1 and public AL1 377531. 
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Human AA836348 (SEQ ID NO: 1 1 3, SEQ ID NO:232) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for AA836348. 

Human R86668 (SEQ ID NO: 14, SEQ ID NO:233) 

The full-length ORF for R86668 was isolated by screening a cDNA library using a 
probe derived from R86668. Since the initial filing of this application, the R8668 sequence 
appeared in the public database as the full-length gene mitogen-activated protein kinase 
kinase kinase 6 (MAP3K6) (NM_00467). 

Human 2R4 1-9-4 (SEQ ID NO: 16, SEQ ID NO:235) 

The full-length virtual ORF for 2R4 1-9-4 was generated using genomic sequence 
to provide the Nterminus for the partial ORF predicted from clone 2R41-9-4 

Table 10. Sequences deleted from the provisional patent due to duplication with other 
genes in the patent 



Prov. SEQ ID NO: (na) 


Prov. SEQ ID NO: (aa) 


160 


196 


213 


214 


215 


216 


122 


126 


119 


123 


148 


184 


4 


20 


7 


23 


205 


206 


14 


30 


15 


31 


35 


56 


42 


63 


51 


72 


44 


65 


77 


91 
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120 



78 


92 


79 


93 


80 


94 


157 


193 



Results 

Table 1 documents the results from the analysis of the nucleic acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 
5 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "ID#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

10 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 

and Plowman G.D. et ah (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "ORF Start", 
"ORF End", "ORF Length" refer to the open reading frame range and length as calculated 
by standard nucleic acid translation programs such as MapDraw (DNAStar). "DNA 
Repeats" refers to regions of low complexity sequence or repetitive elements such as Alu, 

15 LINE, SINE, and LTR sequences. The chromosomal location (CHR localization) for 37 

of the 110 novel protein kinases is shown on Table 1 (NA, not available). The methods 
for determining chromosomal position are outlined below, in Example 2. 

Table 2 documents the results from the analysis of the amino acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 

20 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 

derived from. "E)#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

25 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:1 8-22 

and Plowman G.D. et al (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "nraa Score", 
"ID match aa", "Identity", "Similar", "nraa Match Acc#", Description" refer to the data 
obtained using a Smith- Waterman search of the amino acid sequence against the non- 
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redundant protein database (Matrix: PamlOO; gap open/extension penalties 14/1). "Kinase 
Domain Start" "Kinase Domain End", "Profile Start" and "Profile End" refer to data 
obtained using a Hidden-Markov Model to define catalytic range boundaries. The profile 
has a length of 261 amino acids, corresponding to the complete protein kinase catalytic 
5 domain. Proteins in which the profile recognizes a full length catalytic domain have a 

"Profile Start" of 1 and a "Profile End" of 261 . The boundaries of the catalytic domain 
within the overall protein are noted in the "Kinase Domain Start" and "Kinase Domain 
End" columns. 

1 0 The following abbreviations were used for kinases: 



ASK 


Apoptosis signal-regulating kinase 


CaMK 


Ca2+/calmodulin-dependent protein kinase 


CCRK 


Cell cycle-related kinase 


CDK 


Cyclin-dependent kinase 


CK 


Casein kinase 


DAPK 


Death-associated protein kinase 


DM 


myotonic dystrophy kinase 


Dyrk 


dual-specificity-tyrosine phosphorylating-regulated kinase 


GAK 


Cyclin G-associated kinase 


GRK 


G-protein coupled receptor 


GuC 


Guanylate cyclase 


HffK 


Homeodomain-interacting protein 


IRAK 


lnterleukin-1 receptor-associated kin 


MAPK 


Mitogen activated protein kinase 


MAST 


Micotubule-associated STK 


MLCK 


Myosin-light chain kinase 


MLK 


Mixed lineage kinase 


NMA 


NimA-related protein kinase 


PKA 


cAMP-dependent protein kinase 


RSK 


Ribosomal protein S6 kinase 


RTK 


Receptor tyrosine kinase 
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SGK Serum and glucocorticoid-regulated kinase 
STK serine threonine kinase 
ULK UNC-5 1 -like kinase 

The following abbreviations were used for species 



H 


Human 


M 


Murine 


R 


Rat 


FV 


Fowlpox virus 


MT 


M. thermoautotrophicurn 


CE 


Caenorhabditis elegans 


DM 


Drosophila melanogaster 


OS 


Oryza sativa 


SP 


Schizosaccharomyces pombe 


TP 


Tetrahymena pyriformis 


PI 


Petunia inflata 


NC 


Neurospora crassa 


MSV 


Medic ago sativa 


MSV 


Moloney murine sarcoma virus 


SA 


Squalus acanthias 


CS 


Cucumis sativus 


GM 


Glycine max 


LL 


Lilium longiflorum 


TV 


Trichomonas vaginalis 


MP 


Mycoplasma pneumoniae 


DD 


Dictyostelium discoideum 


SC 


Saccharomyces cerevisiae 


MT 


M ethanobacteri um thermoautotrophi cum 
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Domain and Motif Identification 

A Hidden Markov model (HMM) (Krogh, A., Brown, M., Mian, I. S., Sjolander, 

K., and Haussler, D. (1 994). Hidden Markov models in computational biology: 

Applications to protein modeling. J. Mol. BioL, 235:1501-1531) was used to identify, 
5 both catalytic and extracatalytic domains. Table 4 shows extra-catalytic domains that 

were identified using the HMM program. Other domains such as coiled-coil and pest 

motifs were identified as described next. 

Potential coiled-coil domains were identified using the COILS program 

(www.ch.embnet.org/software/COILS_form.html). The matrix used was MTDDK with 
10 windows of 14, 21, 28 amino acids. Only regions scoring 0.5 or higher were considered to 

have potential coiled-coil domain region. 

Protein sequences containing potential pest motifs were identified using the 

program PESTfmd (www.at.embnet.org/embnet/tools/bio/PESTfind/). PEST regions in 

proteins are by definition sequences that tend to be rich in proline, glutamic or aspartic 
1 5 acid, argininine and histidine; they have been associated with increased protein turnover 

rates (Rogers S. et al (1986) Science 234, 364-368. The algorithm defines PEST 

sequences as hydrophilic stretches of amino acids greater than or equal to 12 residues in 

length. Such regions contain at least one P, one E or D and one S or T. They are flanked 

by lysine (K), arginine (R) or histidine (H) residues, but positively charged residues are 
20 disallowed within the PEST sequence. PESTfmd produces a score ranging form about -50 

to +50. By definition, a score above zero denotes a possible PEST region; a value greater 

than +5 defines a high probability that there is a PEST domain. 

Identification of potential coiled-coil domains and PEST domains in N34132 

Potential coiled-coil domains were identified in N34132 (SEQ ID NO: 183) using 
25 the COILS program. Only regions scoring 0.5 or higher were considered to have potential 

coiled-coil domain region. The amino acid positions within N34231 scoring for potential 

coil-coil regions are shown below. 
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Table 1 1 coiled-coil domains predicted for N34132 



Coiled-coil Region 


Amino acid range 


Length (aa) 


1 


124-147 


24 


1 2 


437-451 


15 


3 


495-526 


32 


4 


1,723-1,749 


27 



Potential PEST domains were identified in N34132 using PESTfind, a value 
greater than +5 defines a high probability that there is a PEST domain. The amino acid 
positions within N34132 scoring for potential PEST regions are shown below. 



Table 12 Potential Pest domains identified in N34132 



PEST Region 


Score 


Amino acid range 


Amino Acid Length 


1 


+ 4.91 


54-95 


42 


2 


+11.4 


537-570 


34 


3 


+31.08 


1293-1304 


12 


4 


+10.15 


1543-1565 


23 


5 


+ 6.17 


1698-1732 


35 



EXAMPLE 2. Chromosomal Localization of Novel Mammalian Protein Kinases 
Materials and Methods 

Several sources were used to find information about the chromosomal localization 
of each of the genes described in this patent. First, the accession number for the nucleic 
acid sequence was used to query the Unigene database. The site containing the Unigene 
search engine is: http://www.ncbi.nlm.nih.gov/UniGene/Hs.Home.html. Information on 
map position within the Unigene database is imported from several sources, including the 
Online Mendelian Inheritance in Man (OMIM, 

http://www.ncbi.nlm. nih.gov/Omim/searchomim. html), The Genome Database 
(http://gdb.infobiogen.fr/gdb/simpleSearch.html), and the Whitehead Institute human 
physical map (http://carbon.wi.mit.edu:8000/cgi-bin/contig/stsjnfo?database=release). 
For example, searching Unigene with W56561, an EST for a MAK-like kinase, the 
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following information is retrieved: Chr.14, D14S65-qTEL. The location of this gene on 
an "ideogram" of the cytogenetic map of chromosome 14 is also provided, showing that 
W56561 maps to the bottom of chromosome 14, between 14q31 and 14qTel. If Unigene 
has not mapped the EST, then the nucleic acid for the gene of interest is used as a query 
5 against databases, such as dbsts and htgs (described at 

http://www.ncbi.nlm.nih. gov/BLAST/blast_databases.html) containing sequences that 
have been mapped already. The nucleic acid sequence is searched using BLAST-2 at 
NCB1 (http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast) and is used to query 
either dbsts or htgs. In addition to the Whitehead and GDB sites mentioned above, 

10 Stanford University maintains a useful site for chromosomal mapping from STS data 

(http:/Mww-shgc.stanford.edu/RH/rhsei^erfonnnew.html). Matches in htgs are often 
resolved immediately because the genomic region hit is annotated in the htgs entry. If an 
exact match match is found (defined roughly as 99% identity over a region of about 100 
base pairs or longer, excluding any repetitive sequence), then the mapped position of the 

1 5 entry in the database is assigned to the original kinase query. Once a cytogenetic region 

has been identified by one of these approaches, disease association is established by 
searching OMIM (see above for URL) with the cytogenetic location. OMIM maintains a 
searchable catalog of cytogenetic map locations organized by disease. A thorough search 
of available literature for the cytogenetic region is alo made using Medline 

20 (http://www.ncbi.nlm.nih.gov/PubMed/medline.html). References for association of the 

mapped sites with chromosomal abnormalities found in human cancer can be found in: 
Knuutila, et ai, Am J Pathol, 1998, 152:1107-1123. 

Results 

25 The chromosomal location for 37 of the 1 10 novel protein kinases is shown on 

Table 1 . Three of the novel protein kinases were mapped to regions associated with 
cancer amplicons, as shown on this table. The regions were also cross-checked with the 
Mendelian Inheritance in Man database, which tracks genetic information for many human 
diseases, including cancer. References for association of the mapped sites with 

30 chromosomal abnormalities found in human cancer can be found in: Knuutila, et al., Am J 

Pathol, 1998, 152:1107-1123. Association of these mapped regions with other diseases is 
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documented in the Online Mendelian Inheritance in Man (OMIM) 
(http://www.ncbi.nlm.nih.gov/htbin-post/Omim). 

EXAMPLE 3: Generation of Specific Immunoreagents 
Materials and Methods 

Peptide sequences to extra-catalytic regions of novel kinases are chosen which are 
not homologous to other known kinases based on a Smith Waterman homology search 
against the non-redundant protein database and predicted to be antigenic based on the 
DNAStar Protean program. These peptides are conjugated to KLH using Glutaraldehyde. 

Rabbits are immunized with the KLH-peptide conjugates by four injections three 
weeks apart. The rabbits are bled ten and fourteen days following the third injection and 
bled out ten days after the fourth. The serum is checked against the peptide by EL1SA. 



Table 13. Peptides to be used as immunogens for raising antibodies 



Clone 


SEQ ID 


Peptide Sequence 


Amino Location 


Name 


NO (aa) 






AA8256850 


124 


KSRDNSRDSSQSEND 


339-353 






TEKLKRS QDLPREPLP 


372-386 






RGWRPYDIHS 


223-232 


5R79-46-1 


126 


FEGPPvRNKJEVMYK 


224-236 






KDDYNETVHKKTE 


451-463 






GTHPKDRNVEKLQ 


541-553 






EVSKYQEYTNELQET 


643-657 


AA256100 


129 


1DDTSNFDDFPESD1 


405-419 






TEPDYKSKDWVFL 
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EXAMPLE 4. Expression analysis of Novel Mammalian Protein Kinases 
GENE EXPRESSION ANALYSIS 
Tissue Arrays 

"cDNA libraries" derived from a variety of sources were immobilized onto nylon 
5 membranes and probed with 32P-labeled cDNA fragments derived from the gene(s) of 

interest. 

Total RNA or mRNA was used as template in a reverse transcription reaction to 
generate single-stranded cDNAs (ss cDNA) that were tagged with specific sequences at 
each end. An oligo dT primer containing a specific sequence (CDS: 

10 AAGCAGTGGTAACAACGCAGAGTACT30VN (V=A,G,C N=A,G,C,T)) anneals at 
the polyA track at the 3' end of the mRNA and the reverse transcriptase (MMLV 
RnaseH-) transcribes the antisense strand until it reaches the end of the RNA strand when 
it adds additional C residues. If a primer (SMII: 
AAGCAGTGGTAACAACGCAGAGTACGCGGG or ML2G: 

15 AAGTGGCAACAGAGATAACGCGTACGCGGG) ending with 3 Gs is added, it anneals 

to the added Cs and the MMLV recognizes the rest of the primer sequence as template and 
continues transcription. As a result, the synthesized cDNAs contain specific sequence tags 
at both the 5 s and the 3' end. When the 5' and the 3 5 ends are tagged with the same 
sequence (CDS and SMII) it is referred to as "symmetric." When the 5' end is tagged 

20 with a different sequence than the 3' end (CDS and ML2G) is referred to as "asymmetric" 
A double-stranded "cDNA library 11 is then generated by PCR amplification using the 
3'PCR and ML2 primers (3' PCR: AAGCAGTGGTAACAACGCAGAGT and ML2: 
AAGTGGCAACAGAGATAACGCGT) that anneal to the added sequence tags. 

The amplified "cDNA libraries" were manually arrayed onto nylon membranes 

25 with a 384 pin replicator. The DNA was denatured by alkali treatment, neutralized and 

cross-linked by UV light. The arrays were pre-hybridized with Express Hyb (Clontech) 
and hybridized with 32P labeled probes generated by random hexamer priming of cDNA 
fragments corresponding to the genes of interest. After washing, the blots were exposed to 
phosphorimaging cassettes and the intensity of the signal was quantified. The amount of 

30 the DNA on the arrays was also quantified by treating non-denatured or denatured arrays 
with Syber Green I or Syber Green II respectively (1 :1 00,000 in 50mM Tris, pH8.0) for 2 
minutes. After washing with 50mM Tris, pH8.0, the fluorescent emission was detected 
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with a phosphorimager (Molecular Dynamics) and quantified. The amount of the arrayed 
DNA was used to normalize the hybridization signal and the corrected values are tabulated 
in Table 3. 

5 Results 

The results of the microarray expression analysis of the protein kinases presented 
in this application is shown in Table 3. Data presentation from left to right is as follows: 
"Tissue": tissue type of the cDNA; "Tumor sym", indicates that the tissue is derived from 
a tumor, "sym" refers to the fact that the 5' and 3' primers used to make the sample are the 

] 0 same; "Normal Sym", indicates normal tissue was used to make the sample, with 

symmetric primers as described above; "Tumor lo", indicates that primary tumor tissue 
was used to make the cDNA; "Tumor cells", indicates that these cDNA samples were 
made from cultured tumor cells; "Normal", indicates that these samples are derived from 
normal tissue or cell lines; "Endos", indicates that these samples are derived from 

15 endothelium-related tissue sources; "p53" refers to the status, mutant or wild-type, of the 
p53 gene in the source samples. Normalized expression values are presented for each 
gene referred to by its SEQ ED# on the subsequent columns. Genes represented in 
expression Table 3 are: SEQ ID NO:3 (AA826850), SEQ ED NO:5 (TBK1), SEQ ID NO:6 
(AA305176), SEQ ID NO:8 (AA256100), SEQ ID NO:9 (CAB43292), SEQ ID NO:l 1 

20 (EPK2), SEQ ID NO:12 (PKNbeta), SEQ IDNO:14 (H19102), SEQ ID NO:16 (RSK4), 

SEQ ID NO: 1 7 (AAD301 82), SEQ ID NO:20 (SGK2), SEQ ID NO:22 (PTK9L), SEQ ID 
NO:26 (AA383293), SEQ ID NO:29 (DRAK2), SEQ ID NO:31 (DRAK1), SEQ ID 
NO:032 (AA015726), SEQ ID NO:40 (MAK-V), SEQ ID NO:044 (TRAD), SEQ ID 
NO:044 (TRAD), SEQ ID NO:45 (AA454060), SEQ ID NO:47 (AA234451), SEQ ID 

25 NO:48 (AA436054), SEQ ID NO:49 (AA626859), SEQ ID NO:5 1 (KIAA0904), SEQ ID 

NO:52 (AA789239), SEQ ID NO:54 (CCRK), SEQ ID NO:55 (CLK4), SEQ ID NO:56 
(AA557536), SEQ ID NO:57 (W56561), SEQ ID NO:60 (AA579641), SEQ ID NO:63 
(NEK7), SEQ ID NO:66 (CAMKKB), SEQ ID NO:68 (HIPK2), SEQ ID NO:72 
(R19609), SEQ ID NO:73 (HRI), SEQ ID NO:78 (AA088547), SEQ ID NO:79 

30 (AA449542), SEQ ED NO:082a (MLK4), SEQ ED NO:82 (MLK4b), SEQ ID NO:84 
(RDM), SEQ ID NO:88 (AA278842), SEQ ED NO:89 (AA1 95964), SEQ ID NO:90 
(MSSK1), SEQ ED NO:93 (TSK4), SEQ ID NO:94 (AI025291), SEQ ED NO:95 
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(AA948538), SEQ ID NO:96 (AA905446), SEQ ID NO:97 (H85389), SEQ ID NO: 100 
(AAO 18361), SEQ ID NO: 101 (AA31 1714), SEQ ID NO:l 10 (AA452647), SEQ ID 
NO:l 1 1 (AA310219), SEQ ID NO: 1 12 (AI086865), SEQ ID NO:l 14 (MEKK6), and SEQ 
IDNO:116(SuRTK106). 

5 

EXAMPLE 5. Kinase assays for Erk, JNK1 and p38 MAP kinases 

293T cells were transiently transfected with HA- p38 or co-transfected with Flag- 
tagged wt MLK4A, kinase-dead MLK4A, wild-type MLK4B or kinase-dead MLK4B 
using Lipofectamine 2000 (Lifetech). Cells were lysed 36 hr post-transfection. Cell 

10 lysates normalized to contain equivalent amounts of HA-p38 were immunoprecipitated 

with anti-HA antibody (Mab HA-1 1, Babco). bnmunoprecipitates were split in two 
portions, one portion was Western-blotted with anti- HA antibody and the other with a 
phospho-specific p38 antibody (Promega) to detect activated levels of p38. Activation of 
Erkl and Jnkl was measured similarly. (This example applies to AA232253 (SEQ ID 

15 NO:82,SEQIDNO:201).) 

Results : 

In transient assays wild-type MLK4A and MLK4B (but not kinase-inactive 
MLK4A(K45M) or MLK4B(K45M)) activate Erk, JNK1 and p38 MAP kinases. 
20 EXAMPLE 6. RAC1 guanine-exchange factor assay 

293T cells were transiently transfected with HA-Racl or co-transfected with Flag- 
tagged Duet C, Duet E, Dbl and HA-Tiam-1 . Cells were lysed 36 hour post-transfection. 
Cell lysates normalized to contain equivalent amounts of Racl were affinity precipitated 
with immobilized GST-PBD (p21 -binding domain of Pak3). Bound proteins were 
25 Western blotted and probed with anti-HA antibody to detect levels of activated Racl . 

((This example applies to R199772 (Trad/Duet)(SEQ ID NO:44, SEQ ID NO: 164).) 



Results : 

Duet C and Duet E both act as guanine nucleotide exchange factors on Racl . 
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CONCLUSION 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
5 those inherent therein. The molecular complexes and the methods, procedures, treatments, 

molecules, specific compounds described herein are presently representative of preferred 
embodiments are exemplary and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention are defined by the scope of the claims. 

10 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the specification are indicative of the 
levels of those skilled in the art to which the invention pertains. 

15 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of and "consisting of 5 may be replaced with either 
of the other two terms. The terms and expressions which have been employed are used as 

20 terms of description and not of limitation, and there is no intention that in the use of such 

terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. 

In particular, although some formulations described herein have been identified by 

25 the excipients added to the formulations, the invention is meant to also cover the final 

formulation formed by the combination of these excipients. Specifically, the invention 
includes formulations in which one to all of the added excipients undergo a reaction 
during formulation and are no longer present in the final formulation, or are present in 
modified forms. 

30 In addition, where features or aspects of the invention are described in terms of 

Markush groups, those skilled in the art will recognize that the invention is also thereby 
described in terms of any individual member or subgroup of members of the Markush 
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group. For example, if X is described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and claims for X being bromine and 
chlorine are fully described. 

Other embodiments are within the following claims. 
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What is claimed is: 

CLAIMS 

1. An isolated, enriched, or purified nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ 
5 ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ 

ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ 
ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 1 38, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ID NO:148, SEQ 

10 ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 

ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ED NO: 158, SEQ 
ED NO:159, SEQ ED NO:160, SEQ ED NO:161, SEQ ED NO:162, SEQ ED NO:163, SEQ 
ID NO: 164, SEQ ED NO: 165. SEQ ED NO:166, SEQ ID NO: 167, SEQ ED NO: 168, SEQ 
ID NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 

15 ID NO: 174, SEQ ED NO: 175, SEQ ID NO:176, SEQ ID NO: 177, SEQ ED NO:178, SEQ 

ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO:l 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ 
ID NO: 1 84, SEQ ED NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ED NO:189, SEQ ED NO:190, SEQ EDNO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ 

20 ID NO: 199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 
ED NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ 
ED NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ED NO:212, SEQ ID NO:213, SEQ 
ED NO:214, SEQ ED NO:215, SEQ IDNO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ 

25 ED NO:224, SEQ ED NO:225, SEQ ID NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ 
ID NO:229, SEQ ED NO:230, SEQ ID NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ 
ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ED NO:242. 
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2. The nucleic acid molecule of claim 1 , wherein said nucleic acid molecule 
comprises a nucleotide sequence that: 

(a) encodes a polypeptide comprising the amino acid sequence set forth 



in SEQ ED NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
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(b) is the complement of the nucleotide sequence of (a); 
30 (c) hybridizes under highly stringent conditions to the nucleotide 

molecule of (a) and encodes a naturally occurring kinase polypeptide; 
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(d) encodes a kinase polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
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ID 


NO:227, 


SEQ 


ID 


NO:228, 


SEQ 


ID 


NO:229, 




SEQ 


ED 


NO:230, 


SEQEDNO:231, 


SEQ 


ID 


NO:232, 


SEQ 


ED 


NO:233, 


SEQ 


ID 


NO:234, 


25 


SEQ 


ID 


NO:235, 


SEQ ED NO:236, 


SEQ 


ID 


NO:237, 


SEQ 


ID 


N0.238, 


SEQ 


ID 


NO:239, 



SEQ ID NO:240, SEQ ID NO:24l , and SEQ ID NO:242, except that it lacks one or more, 
but not all, of a domain selected from the group consisting of an N-lerminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; 
30 (e) is the complement of the nucleotide sequence of (d); 
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(f) encodes a domain of an amino acid sequence selected from the 
group set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO: 1 26, SEQ ED NO: 1 27, SEQ ID NO: 128, SEQ ID NO: 1 29, SEQ ID NO : 1 30, 
SEQ ID NO:131, SEQ ID NO:132, SEQ IDNO:133, SEQ ID NO:134, SEQ ID NO:135, 
SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, 
SEQ ED NO:141, SEQ ID NO:142, SEQ IDNO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 148, SEQ ID NO: 1 49, SEQ ID NO: 1 50, 
SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 
SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. 
SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ED NO: 1 68, SEQ ED NO: 1 69, SEQ ED NO: 1 70, 
SEQ ID NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ED NO: 1 73, SEQ ED NO:l 74, SEQ ED NO: 1 75, 
SEQ ID NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO:l 79, SEQ ED NO: 1 80, 
SEQ ED NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, 
SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, 
SEQ ID NO: 1 91 , SEQ ED NO: 1 99, SEQ ED NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, 
SEQ ED NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ID NO:201 , SEQ ED NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ED NO:210, 
SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ED NO:2 1 6, SEQ ID NO:2 1 7, SEQ ED NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ED NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID NO:235, 
SEQ ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, 
SEQ ID NO:241, and SEQ ED NO:242, wherein said domain is selected from the group 
consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil 
structure region, a proline-rich region, a spacer region, an insert, and a C-terminal tail; 

(g) is the complement of the nucleotide sequence of (f); 

(h) encodes a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ID 
NO:125, SEQ ED NO:126, SEQ ED NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
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NO:130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID 
NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ED NO: 143, SEQ ID NO: 144, SEQ ID 
NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID 

5 NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID 

NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID 
NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID.NO:168, SEQ ID NO:169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ED NO: 172, SEQ ID NO: 173, SEQ ED NO: 174, SEQ ID 

10 NO: 1 75, SEQ ED NO:l 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO: 1 79, SEQ ID 

NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO: 185, SEQ ID NO: 186, SEQ ED NO: 187, SEQ ED NO: 188, SEQ ID NO: 189, SEQ ID 
NO:190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ED NO: 193, SEQ ID NO: 194, SEQ ID 
NO: 195, SEQ ID NO: 196, SEQ ED NO: 197, SEQ ED NO: 198, SEQ ID NO: 199, SEQ ID 

1 5 NO:200, SEQ ED NO:20 1 , SEQ ED NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED 
NO:215, SEQ ED NO:21 6, SEQ ED NO:21 7, SEQ ID NO:21 8, SEQ ED NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ED NO:223, SEQ ED NO:224, SEQ ID 

20 NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but not 
all, of the domains selected from the group consisting of an N-terminal domain, a catalytic 

25 domain, a C-terminal domain, a spacer region, a proline-rich region, a coiled-coil structure 
region, and a C-terminal tail; or 

(i) is the complement of the nucleotide sequence of(h). 
3. The nucleic acid molecule of claim 1 , further comprising a vector or 
promoter effective to initiate transcription in a host cell. 



30 
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4. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule is 
isolated, enriched, or purified from a mammal. 

5. The nucleic acid molecule of claim 4, wherein said mammal is a human. 

6. A nucleic acid probe for the detection of nucleic acid encoding a kinase 

5 polypeptide in a sample, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO:122, SEQ ED NO:123, SEQ ED NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO: 127, SEQ ED NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, 
SEQ ID NO: 132, SEQ ED NO: 133, SEQ DO NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 
SEQ ED NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ED NO: 140, SEQ ID NO: 141, 

10 SEQ ED NO:142, SEQ ED NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO: 146,*' 

SEQ ED NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ ID NO:150, SEQ ID NO:151, 
SEQ ID NO: 152, SEQ ED NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, 
SEQ ED NO: 157, SEQ ED NO: 158, SEQ ED NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ED NO: 162, SEQ ED NO: 163, SEQ ED NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 

15 SEQ ED NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, 

SEQ ID NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ED NO: 175, SEQ ID NO: 176, 
SEQ ID NO: 177, SEQ ED NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, 
SEQ ED NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, 
SEQ ED NO: 1 87, SEQ ED NO: 1 88, SEQ ED NO: 1 89, SEQ ED NO: 1 90, SEQ ID NO: 1 91 , 

20 SEQ ED NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, 
SEQ ED NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, 
SEQ ED NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ED NO:206, 
SEQ ED NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ED NO:21 1, 
SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, 

25 SEQ ID NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID NO:226, 
SEQ ED NO:227, SEQ ED NO:228, SEQ ED NO:229, SEQ ED NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ED NO:236, 
SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ED NO:240, SEQ ID NO:241, 

30 and SEQ ED NO:242. 
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7. The probe of claim 6, wherein said polypeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO : 1 40, SEQ ID NO: 1 4 1 , SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID 

10 NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 
NO: 162, SEQ ID NO:163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID 

15 NO: 1 82, SEQ ID NO: 1 83 , SEQ ID NO: 1 84, SEQ ID NO : 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ED NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 

NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

25 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ LD NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
EDNO:242. 
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8. A recombinant cell comprising a nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ED NO:122, SEQ ED NO:123, SEQ 
ID NO: 124, SEQ ID NO: 125, SEQ ID NO:126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ 
ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ED NO: 1 32, SEQ ED NO: 1 33, SEQ 
5 ED NO: 134, SEQ ED NO: 135, SEQ ED NO:136, SEQ ED NO: 137, SEQ ED NO: 138, SEQ 
ED NO: 139, SEQ ID NO: 140, SEQ ED NO: 141, SEQ ED NO: 142, SEQ ED NO: 143, SEQ 
ED NO: 1 44, SEQ ED NO: 1 45, SEQ ED NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ 
ID NO: 149, SEQ ED NO: 150, SEQ ID NO:151, SEQ ED NO: 152, SEQ ED NO: 153, SEQ 
ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ 

10 ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
ID NO: 1 64, SEQ ED NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ 
ID NO: 1 69, SEQ ED NO: 1 70, SEQ ID NO: 1 71 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ 
ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ 
ID NO: 179, SEQ ED NO: 180, SEQ ED NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ 

15 ID NO: 184, SEQ ED NO: 185, SEQ ID NO: 186, SEQ ED NO:l 87, SEQ ID NO: 188, SEQ 
ID NO: 189, SEQ ED NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ED NO:193, SEQ 
ID NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ ID NO: 197, SEQ ED NO: 198, SEQ 
ED NO: 199, SEQ ID NO:200, SEQ ED NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ED NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ 

20 ID NO:209, SEQ ED NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ 

ID NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ED NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 

25 ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 

ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242. 
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9. The cell of claim 8, wherein said polypeptide is a fragment of a protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID 
NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:l 51, SEQ ID 
NO:l 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 1 56, SEQ ID 

10 NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ IDNO:161, SEQ ID 
NO:l 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID 
NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID 
NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID 

15 NO:182, SEQ IDNO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 

NO:187, SEQ ID NO:188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ IDN0.21 1, SEQ ID 

NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

25 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 
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10. An isolated, enriched, or purified kinase polypeptide selected from the 
group consisting of SEQ ED NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33 , SEQ ID NO: 1 34, SEQ ID NO: 1 35, 
SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, 
SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, 
SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, 
SEQ ED NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, 
SEQ ID NO: 156, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, 
SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. 
SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO. 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, 
SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, 
SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, 
SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 
SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, 
SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 
SEQ ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ID NO:242. 
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1 1 . The polypeptide of claim 10, wherein said polypeptide is a fragment of the 
protein encoded by an amino acid sequence selected from the group consisting of SEQ ID 





NO: 122, 


SEQ 


ID NO: 123, 


SEQ 


ID 


NO: 124, 


SEQ 


ID 


NO:125, 


SEQ 


ID 


NO: 126, SEQ 


ID 




NO:127, 


SEQ 


ID NO: 128, 


SEQ 


ID 


NO: 129, 


SEQ 


ID 


NO:130, 


SEQ 


ED 


NO:131,SEQ 


ID 


5 


NO:132, 


SEQ 


ID NO: 133, 


SEQ 


ID 


NO:134, 


SEQ 


ID 


NO:135, 


SEQ 


ID 


NO: 136, SEQ 


ID 




NO: 137, 


SEQ 


ID NO: 138, 


SEQ 


ID 


NO:139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID 


NO: 141, SEQ 


ID 




NO: 142, 


SEQ 


IDNO:143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 


NO:145, 


SEQ 


ID 


NO: 146, SEQ 


ID 




NO:147, 


SEQ 


ID NO: 148, 


SEQ 


ID 


NO:149, 


SEQ 


ID 


NO:150, 


SEQ 


ID 


NO:151,SEQ 


ID 




NO:152, 


SEQ 


IDNO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO:155, 


SEQ 


ID 


NO: 156, SEQ 


ID 


10 


NO:157, 


SEQ 


ID NO: 158, 


SEQ 


ID 


NO:159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID 


NO: 161, SEQ 


ID 




NO:162, 


SEQ 


ID NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


ID 


NO:165. 


SEQ 


ID 


NO: 166, SEQ 


ID 




NO: 167, 


SEQ 


ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


ID 


NO: 171, SEQ 


ID 




NO: 172, 


SEQ 


ID NO: 173, 


SEQ 


ID 


NO:174, 


SEQ 


ID 


NO: 175, 


SEQ 


ID 


NO: 176, SEQ 


ID 




NO: 177, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


ID 


NO:181,SEQ 


ID 


15 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


ID 


NO: 186, SEQ 


ID 




NO:187, 


SEQ 


ID NO: 188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


ID 


NO:191,SEQ 


ID 




NO: 199, 


SEQ 


ID NO: 193, 


SEQ 


ED 


NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID 


NO: 196, SEQ 


ID 




NO: 197, 


SEQ 


ID NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID 


NO:201,SEQ 


ID 




NO:202, 


SEQ 


ID NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID 


NO:206, SEQ 


ID 


20 


NO:207, 


SEQ 


ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ED 


NO:211,SEQ 


ID 




NO:212, 


SEQ 


IDNO:213, 


SEQ 


ID 


NO:214, 


SEQ 


ED 


NO:215, 


SEQ 


ID 


NO:216, SEQ 


ID 




NO:217, 


SEQ 


IDNO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID 


NO:221,SEQ 


ID 




NO:222, 


SEQ 


ID NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID 


NO:226, SEQ 


ID 




NO:227, 


SEQ 


ID NO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


ID 


NO:231,SEQ 


ID 


25 


NO:232, 


SEQ 


IDNO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ED 


NO:235, 


SEQ 


ID 


NO:236, SEQ 


ID 




NO:237, 


SEQ 


ID NO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ED 


NO:240, 


SEQ 


ID 


NO:241,and SEQ 



ID NO:242. 

1 2. The polypeptide of claim 1 0, wherein said polypeptide comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ IDNO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO: 1 27, SEQ ID NO: 128, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ED NO: 1 3 1 , SEQ 
ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ 
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ID NO: 137, SEQ ID NO:138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 152, SEQ ID NO:153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ 
ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ 
ID NO: 1 77, SEQ BD NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ 
ID NO: 1 82, SEQ ID NO:l 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ED NO: 1 86, SEQ 
ID NO: 187, SEQ ED NO:l 88, SEQ ED NO: 189, SEQ ED NO: 190, SEQ ED NO: 191, SEQ 
ID NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ED NO: 195, SEQ ED NO: 196, SEQ 
ID NO: 197, SEQ ED NO: 198, SEQ ED NO: 199, SEQ ED NO:200, SEQ ED NO:201, SEQ 
ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ED NO:205, SEQ ID NO.206, SEQ 
ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , SEQ 
ID NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:215, SEQ ID NO:216, SEQ 
ID NO:217, SEQ ED NO:218, SEQ IDNO:219, SEQ ED NO:220, SEQ ID N0.221, SEQ 
ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ID N0:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231 , SEQ 
ID NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ED NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and 
SEQ ED NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 122, SEQ ED NO: 123, SEQ ID NO: 124, SEQ ED NO: 125, SEQ ED NO: 126, SEQ 
ID NO: 1 27, SEQ ED NO: 1 28, SEQ ED NO: 1 29, SEQ ED NO: 1 30, SEQ ID NO: 1 3 1 , SEQ 
ED NO: 132, SEQ ED NO: 133, SEQ ED NO: 134, SEQ ED NO:135, SEQ ED NO: 136, SEQ 
ED NO: 137, SEQ ED NO: 138, SEQ ED NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ 
ID NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ ED NO: 145, SEQ ID NO: 146, SEQ 
ED NO: 1 47, SEQ ED NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 
ID NO: 152, SEQ ED NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO: 1 57, SEQ ED NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 61 , SEQ 
ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ 
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ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ 
ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ 
ID NO: 177, SEQ ED NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ 
ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ 
ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO.200, SEQ ID NO:201 , SEQ 
ID NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:2 1 7, SEQ ID NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ED NO:236, SEQ 
ID NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ED NO:241 , and 
SEQ ID NO:242, except that it lacks one or more, but not all of the domains selected from 
the group consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail 
(c) a domain of an amino acid sequence selected from the group set 
forth in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO.140, SEQ ID 
NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID 
NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID 
NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID 
NO: 156, SEQ ID NO: 157, SEQ ED NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID 
NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID 
NO: 1 66, SEQ ID NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID 
NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ED NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID 
NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ED NO: 1 80, SEQ ID 
NO:181 , SEQ ID NO:l 82, SEQ ED NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO:l 85, SEQ ID 
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NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID 
NO:191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID 
NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ED NO: 199, SEQ ID NO.200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
5 NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ED NO:215, SEQ ID 
NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ED NO:223, SEQ ID NO:224, SEQ ED NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ID 

10 NO:231, SEQ ED N0.232, SEQ ED NO:233, SEQ ED N0.234, SEQ ED NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

]5 13. The kinase polypeptide of claim 10, wherein said polypeptide is isolated, 

purified, or enriched from a mammal. 

14. The kinase polypeptide of claim 13, wherein said mammal is a human. 

15. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA144574, AA1 16841, AA256100, AA305176, AA210825, AA316804, AA980090, 

20 N42050, AA476563, AA626690, AA960957, HI 91 02, AA045601, AA107515, 
AA1 09508 or AA887783 polypeptide. 

16. The kinase polypeptide of claim 10, wherein said polypeptide is a H60215, 
AA 197883, AA297313, W30246, AA1 72300, AA383293, AA542015, H01 248, N23936, 
W44160, 2R22-5-1 1, 5R72-18-1, AA021445, AA207220, AA426580, AA544838, 

25 W90839, 5R79-54-1 , AA839940, Rl 9772 or 5R72-8-2 polypeptide. 

1 7. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA234451 polypeptide. 

1 8. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 5R65- 1 6- 
1, AA061797, AA065538, AA124976, AA397553, AA435956, AA575635, AA626859, 

30 AA789239, AI086865, H 1 7727, H29974, AA557536 or N28606 polypeptide. 

19. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA63 1 990 or W08549 polypeptide. 
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20. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R72-16- 
2, R19927 or R43524 polypeptide. 

21 . The kinase polypeptide of claim 10, wherein said polypeptide is a 5R57-10- 
2 polypeptide. 

22. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA232253 polypeptide. 

23. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA430250, AA836348, R86668 or N34132 polypeptide. 

24. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA098024or SuRTK106 polypeptide. 

25. The kinase polypeptide of claim 10, wherein said polypeptide is a R47805, 
AA099102, AA589241, H8581 1, AA013524, AA452647, AA840598, AA088547, 
AA139478, AA826850, R87679, W65887, H97685, W20810, AA599286, AA425725, 
AA103218, AA71 1829, AA060026, AA399669, AA758539, AA883975, AA948538, 
AA018361, AA215311, AA311714, AA498104, 5R69-17-2, 5R69-23-3, 5R69-26-2, 
AA 11 8352, AA396601, AA671275, AA278842, AA460132 or H05721 polypeptide. 
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26. An antibody or antibody fragment having specific binding affinity to a 
kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO: 124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 

5 NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID NO: 137, SEQ ID 
NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID 
NO:143, SEQ IDNO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO: 153, SEQ ID NO: 1 54, SEQ ID NO: 155, SEQ ID NO: 1 56, SEQ ID NO: 157, SEQ ID 

10 NO:158, SEQ IDNO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID 
NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 171, SEQ ID NO: 1 72, SEQ ID 
NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID 
NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID 

15 NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID 

NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO:l 90, SEQ ID NO: 191 , SEQ ID NO: 1 99, SEQ ID 
NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID 
NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID NO:2 12, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:2 1 8, SEQ ID NO:2 1 9, SEQ ID NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

27. The antibody or antibody fragment of claim 26, wherein said polypeptide 
comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131 , SEQ 
ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 13 5, SEQ ID NO: 136, SEQ 
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ID 


NO:137, 


SEQ 


ID NO: 138, 


SEQ 


ID 


NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID NO:141, 


SEQ 


ID 


NO: 142, 


SEQ 


IDNO:143, 


SEQ 


ID 


NO: 144, 


SEQ 


ID 


NO: 145, 


SEQ 


ID NO: 146, 


SEQ 


ID 


NO:147, 


SEQ 


ID NO: 148, 


SEQ 


ID 


NO:149, 


SEQ 


ID 


NO:150, 


SEQ 


IDNO:151, 


SEQ 


ID 


NO:152, 


SEQ 


IDNO:153, 


SEQ 


ID 


NO:154, 


SEQ 


ID 


NO:155, 


SEQ 


ID NO: 156, 


SEQ 


ID 


NO:157, 


SEQ 


1DN0:158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID NO:161, 


SEQ 


ID 


NO: 162, 


SEQ 


ID NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


ID 


NO:165. 


SEQ 


ID NO: 166, 


SEQ 


ID 


NO:167, 


SEQ 


ID NO: 168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


IDNO:171, 


SEQ 


ID 


NO: 172, 


SEQ 


ID NO: 173, 


SEQ 


ID 


NO:174, 


SEQ 


ID 


NO:175, 


SEQ 


ID NO: 176, 


SEQ 


ID 


NO: 177, 


SEQ 


ID NO: 178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


IDNO:181, 


SEQ 


ID 


NO: 182, 


SEQ 


ID NO: 183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO:185, 


SEQ 


rDNO:186, 


SEQ 


ID 


NO: 187, 


SEQ 


ID NO: 188, 


SEQ 


ID 


NO: 189, 


SEQ 


ID 


NO: 190, 


SEQ 


IDNO:191, 


SEQ 


ID 


NO: 199, 


SEQ 


ID NO: 193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID NO: 196, 


SEQ 


ID 


NO: 197, 


SEQ 


ID NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID NO:201, 


SEQ 


ID 


NO:202, 


SEQ 


IDNO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID NO:206, 


SEQ 


ID 


NO:207, 


SEQ 


ID NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


IDNO:211, 


SEQ 


ID 


NO:212, 


SEQ 


IDN0.213, 


SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


IDNO:216, 


SEQ 


ID 


NO:217, 


SEQ 


IDNO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID NO:221, 


SEQ 


ID 


NO:222, 


SEQ 


IDNO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


IDNO:226, 


SEQ 


ED 


NO:227, 


SEQ 


BDNO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


IDNO:231, 


SEQ 


ID 


NO:232, 


SEQ 


IDNO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 


ID NO:236, 


SEQ 


ID 


NO:237, 


SEQ 


IDNO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO.240, 


SEQ 


IDNO:241, 


and 



SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ 

25 ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ 

ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ 
ID NO: 137, SEQ ED NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ ID NO: 141, SEQ 
ID NO: 142, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ 
ID NO: 1 47, SEQ ID NO: 1 48, SEQ ED NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ 

30 ID NO: 152, SEQ ID NO:l 53, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO:l 56, SEQ 
IDNO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ 
ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ 
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ID NO: 1 67, SEQ ED NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 71 , SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ 
H) NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ 
ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ 
ID NO: 187, SEQ DD NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ 
ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ 
ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQ ID NO:242, except that it lacks one or more, but not all, of the domains selected from 
the group consisting of a C-terminal domain, a catalytic domain, an N-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-termina] tail. 

(c) a domain of an amino acid sequence selected from the group set 
forth in SEQ ED NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 



NO: 126, 


SEQ 


ID 


NO:127, 


SEQ ID NO: 128, 


SEQ 


ED 


NO:129, 


SEQ 


DD 


NO: 130, 


SEQ 


ID 


NO:131, 


SEQ 


ID 


NO:132, 


SEQIDNO:133, 


SEQ 


ID 


NO:134, 


SEQ 


ID 


NO:135, 


SEQ 


ID 


NO: 136, 


SEQ 


ID 


NO: 137, 


SEQIDNO:138, 


SEQ 


ID 


NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID 


NO:141, 


SEQ 


ID 


NO: 142, 


SEQ ED NO: 143, 


SEQ 


ID 


NO: 144, 


SEQ 


DD 


NO: 145, 


SEQ 


ED 


NO: 146, 


SEQ 


ID 


NO:147, 


SEQ ID NO: 148, 


SEQ 


ID 


NO: 149, 


SEQ 


DD 


NO:150, 


SEQ 


ID 


NO:151, 


SEQ 


ID 


NO:152, 


SEQIDNO:153, 


SEQ 


DD 


NO:154, 


SEQ 


ED 


NO:155, 


SEQ 


ED 


NO: 156, 


SEQ 


ID 


NO:157, 


SEQ ID NO: 158, 


SEQ 


DD 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


DD 


NO:161, 


SEQ 


ID 


NO: 162, 


SEQ ID NO: 163, 


SEQ 


DD 


NO: 164, 


SEQ 


ED 


NO:165. 


SEQ 


DD 


NO: 166, 


SEQ 


ID 


NO: 167, 


SEQ ID NO: 168, 


SEQ 


DD 


NO: 169, 


SEQ 


ED 


NO: 170, 


SEQ 


ID 


NO:171, 


SEQ 


ID 


NO:172, 


SEQ ID NO: 173, 


SEQ 


DD 


NO: 174, 


SEQ 


ID 


NO:175, 


SEQ 


ID 


NO: 176, 


SEQ 


ID 


NO:177, 


SEQIDNO:178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


ID 


NO:181, 


SEQ 


ID 


NO: 182, 


SEQIDNO:183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO:185, 


SEQ 


ID 
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NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID 
NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID 
NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 
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28. A hybridoma which produces an antibody having specific binding affinity 
to a kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ED 





NO:123, 


SEQ 


ID 


NO:124, 


SEQ 


ID 


NO:125, 


SEQ 


ID 


NO:126, 


SEQ ED NO: 127, 


SEQ 


ID 




NO:128, 


SEQ 


ID 


NO: 129, 


SEQ 


ED 


NO: 130, 


SEQ 


ID 


NO:131, 


SEQ ED NO: 132, 


SEQ 


ID 


5 


NO:133, 


SEQ 


ID 


NO:134, 


SEQ 


ID 


NO: 135, 


SEQ 


ID 


NO: 136, 


SEQ ID NO: 137, 


SEQ 


ID 




NO:138, 


SEQ 


ID 


NO: 139, 


SEQ 


ID 


NO: 140, 


SEQ 


ID 


NO:141, 


SEQ ID NO: 142, 


SEQ 


ID 




NO:143, 


SEQ 


ID 


NO:144, 


SEQ 


ID 


NO: 145, 


SEQ 


ID 


NO: 146, 


SEQ ID NO: 147, 


SEQ 


ID 




NO:148, 


SEQ 


ID 


NO:149, 


SEQ 


ID 


NO: 150, 


SEQ 


ID 


NO:151, 


SEQ ID NO: 152, 


SEQ 


ID 




NO:153, 


SEQ 


ID 


NO: 154, 


SEQ 


ID 


NO: 155, 


SEQ 


ID 


NO:156, 


SEQ ID NO: 157, 


SEQ 


ID 


10 


NO:158, 


SEQ 


ID 


NO: 159, 


SEQ 


ID 


NO: 160, 


SEQ 


ID 


NO:161, 


SEQ ID NO: 162, 


SEQ 


ID 




NO: 163, 


SEQ 


ID 


NO: 164, 


SEQ 


ID 


NO: 165. 


SEQ 


ID 


NO: 166, 


SEQ ID NO: 167, 


SEQ 


ID 




NO:168, 


SEQ 


ID 


NO: 169, 


SEQ 


ID 


NO: 170, 


SEQ 


ID 


NO:171, 


SEQ ED NO: 172, 


SEQ 


ID 




NO:173, 


SEQ 


ID 


NO:174, 


SEQ 


ID 


NO: 175, 


SEQ 


ID 


NO: 176, 


SEQ ED NO: 177, 


SEQ 


ID 




NO:178, 


SEQ 


ID 


NO: 179, 


SEQ 


ID 


NO: 180, 


SEQ 


ED 


NO:181, 


SEQ ED NO: 182, 


SEQ 


ID 


15 


NO:183, 


SEQ 


ID 


NO: 184, 


SEQ 


ID 


NO: 185, 


SEQ 


ED 


NO:186, 


SEQ ED NO: 187, 


SEQ 


ID 




NO: 188, 


SEQ 


ID 


NO:189, 


SEQ 


ID 


NO: 190, 


SEQ 


ID 


NO:191, 


SEQ ED NO: 199, 


SEQ 


ID 




NO: 193, 


SEQ 


ID 


NO: 194, 


SEQ 


ID 


NO: 195, 


SEQ 


ID 


NO: 196, 


SEQ ID NO: 197, 


SEQ 


ID 




NO: 198, 


SEQ 


ID 


NO: 199, 


SEQ 


ID 


NO:200, 


SEQ 


ID 


NO:201, 


SEQ ID NO:202, 


SEQ 


ID 




NO:203, 


SEQ 


ID 


NO:204, 


SEQ 


ID 


NO:205, 


SEQ 


ID 


NO:206, 


SEQ ID NO:207, 


SEQ 


ID 


20 


NO:208, 


SEQ 


ID 


NO:209, 


SEQ 


ID 


NO:210, 


SEQ 


ID 


NO:211, 


SEQIDNO:212, 


SEQ 


ID 




NO:213, 


SEQ 


ID 


NO:214, 


SEQ 


ID 


NO:215, 


SEQ 


ID 


NO:216, 


SEQ IDNO:217, 


SEQ 


ID 




NO:218, 


SEQ 


ID 


NO:219, 


SEQ 


ID 


NO:220, 


SEQ 


ID 


NO:221, 


SEQ ID NO:222, 


SEQ 


ID 




NO:223, 


SEQ 


ID 


NO:224, 


SEQ 


ID 


NO:225, 


SEQ 


ID 


NO:226, 


SEQEDNO:227, 


SEQ 


ID 




NO:228, 


SEQ 


ID 


NO:229, 


SEQ 


ID 


NO:230, 


SEQ 


ED 


NO:231, 


SEQ ID N0.232, 


SEQ 


ID 


25 


NO:233, 


SEQ 


ID 


NO:234, 


SEQ 


ID 


NO:235, 


SEQ 


ED 


NO:236, 


SEQ ID NO:237, 


SEQ 


ID 




NO:238, 


SEQ 


ID 


NO:239, 


SEQ 


ID 


NO:240, 


SEQ 


ED 


NO:241, 


and SEQ ED NO:242. 





29. A method for identifying a substance that modulates kinase activity 



comprising: 

(a) contacting a kinase polypeptide selected from the group consisting 
30 SEQ ED NO:122, SEQ ED NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ED NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO:131, 
SEQ ED NO: 132, SEQ ED NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, 
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SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, 
SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, 
SEQ ID NO: 1 47, SEQ ED NO: 148, SEQ ID NO: 149, SEQ ID NO: 1 50, SEQ DD NO: 1 5 1 , 
SEQ ID NO: 1 52, SEQ DD NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, 
5 SEQ ID NO: 157, SEQ ID NO: 158, SEQ DD NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65 . SEQ ID NO: 1 66, 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, 
SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, 

10 SEQ ED NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, 
SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ DD NO: 1 91 , 
SEQ DD NO: 199, SEQ DD NO: 193, SEQ DD NO: 194, SEQ DD NO: 195, SEQ ID NO: 196, 
SEQ DD NO: 197, SEQ ED NO: 198, SEQ DD NO: 199, SEQ DD NO:200, SEQ DD NO:201, 
SEQ DD NO:202, SEQ ED NO:203, SEQ DD NO:204, SEQ DD NO:205, SEQ DD NO:206, 

15 SEQ DD NO:207, SEQ DD NO:208, SEQ DD NO:209, SEQ DD NO:2 1 0, SEQ DD NO:2 1 1 , 
SEQ DD NO:212, SEQ DDNO:213, SEQ DD NO:214, SEQ DD NO:215, SEQ ID NO:216, 
SEQ DD NO:21 7, SEQ DD NO:21 8, SEQ DD NO:21 9, SEQ DD NO:220, SEQ DD NO:22 1 , 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ DD NO:226, 
SEQ DD NO:227, SEQ DD NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 

20 SEQ DD NO:232, SEQ DD NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 

SEQ DD NO:237, SEQ DD NO:238, SEQ DD NO:239, SEQ DD NO:240, SEQ ID NO:24 1 , 
and SEQ DD NO:242 with a test substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining whether said substance modulates the activity of said 

25 polypeptide. 

30. A method for identifying a substance that modulates kinase activity in a 
cell comprising: 

(a) expressing a kinase polypeptide in a cell, wherein said polypeptide 
is selected from the group consisting of SEQ DD NO:122, SEQ DD NO:123, SEQ ID 
30 NO: 124, SEQ DD NO: 125, SEQ DD NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID 
NO: 129, SEQ DD NO: 130, SEQ ID NO: 131, SEQ DD NO: 132, SEQ DD NO: 133, SEQ ID 
NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID 
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NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ED NO: 142, SEQ ID NO: 143, SEQ ID 
NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
NO: 1 49, SEQ ID NO:l 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID 
NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID 
5 NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID 

NO: 164, SEQ ID NO:165. SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID 
NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID 
NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID 
NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 

10 NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID 

NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID 
NO: 1 94, SEQ ID NO:l 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ED NO: 1 98, SEQ ID 
NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID 

15 NO:209, SEQ DDNO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ID 

NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID 
NO:219, SEQ ED NO:220, SEQ ID NO:22 1 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ED NO:230, SEQ ED NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID 

20 NO:234, SEQ DD NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ DD NO:242; 

(b) adding a test substance to said cell; and 

(c) monitoring a change in cell phenotype or the interaction between 
said polypeptide and a natural binding partner. 
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31. A method for treating a disease or disorder by administering to a patient in 
need of such treatment a substance that modulates the activity of a kinase selected from 
the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID 

5 NO: 130, SEQ ED NO:131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID 
NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ED NO: 139, SEQ ID 
NO: 140, SEQ ED NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO:145, SEQ ED NO:146, SEQ ED NO:147, SEQ ED NO:148, SEQ ED NO:149, SEQ ID 
NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID 

10 NO: 155, SEQ ED NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ED NO: 159, SEQ ID 
NO: 160, SEQ ID NO: 161, SEQ ED NO: 162, SEQ ED NO: 163, SEQ ID NO: 164, SEQ ID 
NO: 1 65. SEQ ED NO: 1 66, SEQ ED NO: 1 67, SEQ ED NO: 168, SEQ ED NO: 1 69, SEQ ID 
NO: 170, SEQ ED NO: 171, SEQ ED NO: 172, SEQ ED NO: 173, SEQ ED NO: 174, SEQ ID 
NO: 1 75, SEQ ED NO: 1 76, SEQ ED NO: 1 77, SEQ ED NO: 1 78, SEQ ED NO: 1 79, SEQ ED 

15 NO: 180, SEQ ED NO: 181, SEQ ED NO: 182, SEQ ED NO: 183, SEQ ED NO: 184, SEQ ID 
NO: 1 85, SEQ ED NO: 1 86, SEQ ED NO: 1 87, SEQ ID NO: 1 88, SEQ ED NO: 1 89, SEQ ID 
NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID 
NO: 1 95, SEQ ID NO: 1 96, SEQ ED NO: 1 97, SEQ ID NO: 1 98, SEQ ED NO: 1 99, SEQ ID 
NO:200, SEQ ED NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 

20 NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ED NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ED NO:21 1, SEQ ID NO:212, SEQ ED NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ EDNO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID N0:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ED NO:226, SEQ ED NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID 

25 NO:230, SEQ ED NO:231, SEQ ED NO:232, SEQ ED NO:233, SEQ ED NO:234, SEQ ID 
NO:235, SEQ ED NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID 
NO:240, SEQ ED NO:241, and SEQ ED NO:242. 

32. The method of claim 31 , wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

30 neurodegenerative disorders, and cancer. 

33. The method of claim 31, wherein said substance modulates kinase activity 

in vitro. 
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34. The method of claim 33, wherein said substance is a kinase inhibitor. 

35. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) contacting said sample with a nucleic acid probe which hybridizes 
5 under hybridization assay conditions to a nucleic acid target region of a kinase polypeptide 

selected from the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, 
SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, 
SEQ ID NO: 1 35, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, 

10 SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 

SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, 
SEQ IDNO:150, SEQ IDNO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ED NO: 154, 
SEQ ID NO:l 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 158, SEQ ED NO: 1 59, 
SEQ ID NO:l 60, SEQ ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ED NO: 1 64, 

15 SEQ ID NO: 1 65. SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ED NO: 1 69, 

SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ED NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, 
SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ED NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, 
SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ID N0:185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, 

20 SEQ ID NO: 1 90, SEQ ID NO: 1 91 , SEQ ID NO: 1 99, SEQ ID NO: 193, SEQ ID NO: 1 94, 

SEQ ID NO:l 95, SEQ ID NO: 1 96, SEQ ID NO: 1 97, SEQ ID NO: 1 98, SEQ ID NO: 1 99, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:2 1 0, SEQ ID NO:2 1 1 , SEQ ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, 

25 SEQ ID NO:215, SEQ IDNO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 

SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID N0.224, 
SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID N0.228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ DO NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 

30 SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242, said probe comprising the 

nucleic acid sequence encoding said polypeptide, fragments thereof, or the complements 
of said sequences and fragments; and 
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(b) detecting the presence or amount of the probe :target region hybrid 

as an indication of said disease. 

36. The method of claim 35, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

5 neurodegenerative disorders, and cancer. 

37. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) comparing a nucleic acid target region encoding said kinase 
polypeptide in a sample, wherein said kinase polypeptide is selected from the group 

0 consisting of SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 125, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ 
ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO:l 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, SEQ 
ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 40, SEQ 
ID NO: 1 4 1 , SEQ ID NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45 , SEQ 

.5 ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO:l 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ 
ID NO: 156, SEQ ID NO: 157, SEQ ID NO:158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ 
ID NO: 1 61 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65. SEQ 
ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ 

10 ID NO: 171, SEQ ID NO: 172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO: 175, SEQ 

ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ 
ID NO: 1 81 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO:l 84, SEQ ID NO: 1 85, SEQ 
ID NO: 186, SEQ ED NO: 187, SEQ ED NO: 188, SEQ ED NO: 189, SEQ ID NO: 190, SEQ 
ED NO: 191, SEQ ED NO: 199, SEQ ED NO: 193, SEQ ED NO: 194, SEQ ID NO: 195, SEQ 

15 ID NO: 1 96, SEQ ED NO: 1 97, SEQ ED NO: 1 98, SEQ ED NO: 1 99, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ED NO:202, SEQ ED NO:203, SEQ ED NO:204, SEQ ED NO:205, SEQ 
ID NO:206, SEQ ED NO:207, SEQ ED NO:208, SEQ ED NO:209, SEQ ED NO:210, SEQ 
ID NO:21 1 , SEQ ED NO:212, SEQ ED NO:213, SEQ ED NO:214, SEQ ED NO:21 5, SEQ 
ID NO:216, SEQ ED NO:217, SEQ ED NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ 

30 ED NO:221 , SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ED NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ED NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
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ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242, or one or more fragments thereof, with a control nucleic 
acid target region encoding said kinase polypeptide, or one or more fragments thereof; and 
(b) detecting differences in sequence or amount between said target 
5 region and said control target region, as an indication of said disease or disorder. 

38. The method of claim 37, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 
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FIGURE 1A 

SEO ID NO: 122_X69117_H BARK2_H 

MADLEAVLADVS YLMAMEKSKATPAARASKRI VLPEPS I RSVMQKYLAERNE I TFDKI FN 
OKIGFLLFKDFCLNEINEAVPQVKFYEEIKEYEKLDNEEDRLCRSRQIYDAYIMKELLSC 
SHPFSKOAVEHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFMESDKFTRFCQWKNV 

ELNI HLTMNEFSVHR 1 1 GRGGFGEVYGCRKADTGKMYAMKCLDKKRI KMKQGETLALNER 
IMLSLVSTGDCPFIVCMTYAFHTPDKLCFILDLMNGGDLHYHLSQHGVFSEKEMRFYATE 
IILGLEHVHNRFVVYRDLKPANILLDEHGHARISDLGLACDFSKKKPHASVGTHGY^4APE 
VLOKGTAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVELPDTFSPE 
LKSLLEGLLQRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLIPPRGEVNAA 
DAFDIGSFDEEDTKGIKLLDCDQELYKNFPLVISERWQQEVTETVYEAVNADTDKIEARK 
RAKNKOLGHEEDYALGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQNL 
LTMEQILSVEETQIKDKKCILFRIKGGKQFVLQCESDPEFVQWKKELNETFKEAQRLLRR 

apkflnkprsgtvelpkpslchrnsngl 

seo id no • 123 aa144 574_m bark2_m 

cfwyrdlkpanilIdeyghvrisdlglacdfskkkphasvgthgymapevlqkgtcyds 

SADWFSLGCMLFKLLRGH S P FRQH KT KDKHE I DRMTLTVNVQLPDAFS PELR SLLEGLLQ 

rdvsorlgcggggarelkehiffkgidwqhvylrkyppplipprgevnaadafdigsfde 

EDTKGI KLLDCDQDLYKNFPLVI SERWQQEWETI YDAVNADTDKI EARKKAKNKQLGQE 

edyamgkdcimhgymlklgnpfltqwqrryfylfpnrlewrgegesrqslltmeqimsve 

ETQ I KDRKC I LLR I KGGKQFVLQCESDPE FAQWLKELTCTFNEAQRLLRRAP KFLNKPRA 
AILEFSKPPLCHRNSSGL 

^EO ID NO: 124 AA826850 H 

MGSSMSAATARRPVFDDKEDVNFDHFQILRAIGKGSFGKVCIVQKRDTEKMYAMKYMNKQ 

oSIrdeWfreleilqeiehvflvnlwysfqdeedmfmvvdlllggdlryhlqqnvq 

FSEDTVRLY I CEMALALDYLRGQH 1 1 HRDVKPDNI LLDERGHAHLTDFN I AT 1 1 KDGERA 
TAT ART KPYMAPE I FX S F VNGGTG Y S FEVDWW S VGVMAYELLRGWR P YD I H S SNAVE S LV 
OLFSTVSVQYVPTWSKEMVALLRKLLTVNPEHRLSSLQDVQAAPALAGVLWDHLSEKRVE 
PGFVPNKGRLHCDPTFELEEMILESRPLHKKKKRLAKNKSRDNSRDSSQSENDYLQDCLD 
AIQQDFVIFNREKLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCGPICPSAGSG 

SEO ID NO: 125 AA960957_H 

MGGNHSHKPPVFDENEEVNFDHFQILRAIGKGSFGKVCIVQKRDTKKMYAMKYMNKQKCI 
ERDEVR^FRELQIMQGLEHPFLVNLWYSFQDEEDMFMVVDLLLGGDLRYHLQQNVHFTE 

AGTKPYMAPEVFQVYMDRGPGYSYPVDWWSLGITAYELLRGWRPYEIHSVTPIDEILNMF 
KVERWYSSTWCKGMVALLRKLLTKDPESRVSSLHDIQSVPYLADMNWDAVFKI^ 
VPNKGRLNCDPTFELEEMILESKPLHKKKKRLAKNRSRDGTKDSCPLNGHLQHCLETVRE 
EFI I FNREKLRRQQGQGSQLLDTDSRGGGQAQSKLQDGCNNNLLTHTCTRGCSS 

Sts^^smiSStLvfrgrhkktgdlfai KVFNNI sflrpvdvqmrefevlk 

S^VKLFAIEEETTTRHKVLIMEFCPCGSLYTVLEEPSNAYGLPESEFLIVLRDW 

g^reSivhrdikpgnimrvigedgqsvykltdfgaareleddeqfvslygte^ 

SpDmERAVLRKDHQKKYGATVDLWSIGVTFYHAATGSLPFRPFEGP 

kpsSiSvq^ngpidwsgdmpvscslsrglqvlltpvlanile^ 
S^JhrSfslqqmtahkiyihsyntatifhelvykqtkiissnqeliyegrrlv 

lepgrSZpktteenpifwsreplntigliyekislpkvhprydldgdas^kaitg 
wcyacriastlllyqelmrkgirwlielikddynetvhk^ 
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FIGURE IB 

GGLRNVDCL 

SEQ ID NO- 127 - AA305176 -" ra vo t .qTl!EFSIVKPISRGAFGKVYLGQKGGKLYR 
mStAGSKKEPGGG^TEEG^RIAWKPPSIEEFSIV ^ LWEYLIG GDVKS 



SASSQSHTFI 



SSVESECHSSPKWEKDCQV 



PQPDDETDTSYFEARJWAQHLTVSGFSL 

"?™lthnppsdfsfq N mns^t«k^RO^ ve , ISEKAKDLI i,r 

c sepds 

CPPQDHDMQGLAERI SVL 

SEQ ID NO: 1 3 j!mQNrT SDPyT KITNLSQKTKVI KKTLTPTWNETFFVHFPEKTTLEL 
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FIGURE 1C 



SEQ ID NO: 132_AA316804_H _ SARLSNGSFSAPS LTNSRGSVHTV 
MSANNSPPSAQKSVLPTAIPA^^^ 

SFLWIGLTRESVTIEAQELSLSAy^LVCSIVYQK pTFCDyC GEMLWGLVR 
ILQLITSADEIHEGDLVE^ 

QGLKCEGCGUWHKRCAFKIP^CSGW^RLS^ YCKRLL KGLFRQGM 

hvhqepskripswsgrpiwme^crv^phtfaw 

Q CKDCKPNC = ^ 




SNPHCFE 1 1 TDTMVYFVGENNGDS ^^^^^^q j p^gYLGS^FGI^GGKHRKTGR 
CTSPGQGKDHKDLSTSISVSNCQIQE^ISWQ^^^^^^^^ G ^ 
UlKVIDKMRFPTKQESQL^EVAILQNLHHPG 

FGF ^onSp^™^^ 
S^etrSt™ 

SEQ ID NO: 133 - P ^^ T " . IOKELK lKEGVENLRRVATDRRHLGHVQQLLRSS 

meegaprqpgpsqwppedekevirraiqkelkikeg EQLRARH lealrrqlhvelkv 

NRRLEQLHGELRELHARILLPG^GPAEPVA^^ 
KQGAENMTHTCASOT^ 

EELQHRLHVEAAVAEGAKNW^LSSRRTQDR^ 

===== 

RGPSPPASPTRKPPRWDFRCWVLGRGHFGKVLLVQFK DLMMQIHEDV FPEPQ 
IESLVCEKRILEAVGCTGHPFLLSLLVCFQTSSHRRt CKEG IGFGDRTSTFC 

SEQ ID NO: 134_AI021023_M ^!T*="„ TpEFLA p E vLTQEAYTRAVDWWG 
HSLLTARQQAAFRDFDFVSERFLEP 

QFim,FLPEFPIRPIR<^^^?^^^S? DIjyS I,« S AVGCFPEASI 
SAETHPFDDFDCDLESFLLYPIPA 
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FIGURE ID 

seo id no: 136 aa476563 h 

meffridskdsaseLlgldfgekLyslkseplkpfftlpdgdsasrsfntseskvefkaq 

DTI SRGSDDSVPVI SFKDAAFDDVSGTDEGRPDLLVNLPGELESTREAAAMGPTKFTQTN 
IGIIENKLLEAPDVLCLRLSTEQCQAHEEKGIEELSDPSGPKSYSITEKHYAQEDPRMLF 
J^SsGDMSLLPSSDPKFQGLGVVESAVTANNTEESLFRICSPLSGANEYIASTDT 
LKTEEVLLFTDQTDDLAKEEPTSLFQRDSETKGESGLVLEGDKEIHQIFEDLDKKLALAS 

rfyIpegciqrwaae^aldai^regivcrdlnpnnilimdrghiqltyfsrwseveds 

CDSDAIERMYCAPEVGAITEETEACDWWSLGAVLFELLTGKTLVECHPAGINTHTTLNMP 
ECVSEEARSLIQQLLQFNPLERLGAGVAGVEDIKSHPFFTPVDWAELMR 

qFO ID NO- 137 AA626690 H 

MLPFAPQDEPWDREMEVFSGGGAisGEVNGLKMVDEPMEEGEADSCHDEGWKEIPITHH 
VKEGYEKADPAQFELLKVLGQGSFGKVFLVRKKTGPDAGQLYAMKVLKKASLKVRDRVRT 

S?^vn5pfivklhyafqtegklylildflrggdvftrlskevlfteedvkfyla 
SSXhlZgivyrdlkpenilldeighikltdfglskesvdqekkaysfcgtveym 

Apt^RGHSQSADWWSYGVLMFEMLTGTLPFQGKDRNETMIWILKAKLGMPQFLSA^ 

osllr^fkrnpanrlgsegveeikrhlffanidwdklykrevqppfkpasgkpddtfcf 

Sp^totLglpasm^qlfkgfsfvatsiaeeykitpitsi^pivqin^ 
qfgevyelkedigvgsysvckrcihattnmefavkiidkskrdpseeieilmrygqhpni 
?™vfddgrywlvtdlmkggelldrilkqkcfsereasdilyvisktvdylhcqgvv 
Srdlkpsnilymdesasadsiricdfgfakqlrgengllltpcytanfvapevlmqogyd 
aacdiwslgvlfytmlagytpfangpndtpeeillrigngkfslsggnwdnisdgakdll 

tHMLHMDPHQRYTAEQILKHSWITHRDQLPNDQPKRNDVSHWKGAMVATYSALTHKTFQ 

pvlepvaasslaqrrsmkkrtstgl 
seo id no: 138 aa215680 h 

MSLVACECLPSPGLiPEPCSRARiQAHVYLEQIRNRVALGVPDMTKRDYLVDAATQIRLA 

lIrdvsedyeaafitoyqngvdvllrgihvdpnkerreavklkitkylrraeeifnchlqr 

plssgaIpsagfsslr^pirtlssaveqlrgcrwgviekvqlvqdpatggt 

rch^sIeLtiiphgvpymtkllryfvsedsiflhlehvqggtlwshllsqahsrhsgl 

ssg^qermkaqlnphi^lltparlpsghapgqdrialepprtspnlllageapstrpqr 

IaISptartst^gssdlpkapgghlhlqarragqns 

rgmdqsclsadgagrgcgratwsvreeqvkqwaaemlvalealheqgvlcrdlhpgnlll 
doaghirltyfgqwsevepqccgeavdnlysapevggiselteacdwwsfgsllyelltg 

tiqwsklvg 

mtvkteaa^gtltysrmr^ 

sqpqepe^^a^pspppspsqqinlgpssnphakpsdfhflkvigkgsfgkvllarh^e 
evfyavkvlqkkailkkkeekhimseri^llknvkhpflvglhfsfqtadklyfvldyin 

ggeSySqrercfleprarfyaaeiasalgyl^ 
^glcSnieSsttstfcgtpeyiap 

ddlinkkitppfnpnvsgpnelrhfdpefteepvpnsigkspdsvlvtasvkeaaeaflg 
fsyapptdsfl 

MTVKAEAA^STLTYS^ 

shpSp^anpspppspsqqinlgpssnphakpsdfhflkvigkgsfgkvliarhkae 
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FIGURE IE 

SYAPPVDSFL 

GVEPEDTTSTFCGTPEYLAPEVLRKEPYDKAVUW u ^ppsp^DDLYHK 
ILDC 

SEQ ID NO: ^2 ^887783 -» T ^ LVSVGRSEWF VFRRYAEFDKLYNT 
VNASVLEADDAFVGFSYAPPSEDLFL 

SEQ ID NO: 144 - H602 ^-" _ rcrT c rNNA KRAGPFILGPRLGNSPVPSIVQCLAR 
MSKLRMKRRASDRGAGETSARAKALGSGISGNNAKRAGPt DGW HHHGLFQD 

KDGTDDFYQLKI LTLEERGDQG ^ ^^^^pgp^^LINLQHYVI KEKRLSERETV 
RTCEIVEDTESSRMV^^ 

VI FYDWRWEALHQKNI VHRDLKLGNMVLNKRTHRITITN ^EYTI PE 

spayispdvlsgrpyrgkpsdmwalgwlf™ 

DAQPMTSLDTAI LAQRYLRK 

WKKLCTLDGKOVRVTCVHLPDFFGDDDV^CGPEKFRY Q mGNRCSE£SThV EK 
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FIGURE IF 

„„„»™™ -SKSSSSESSSHSSSS^ 

GRRQVWPDCGAGLEVFELGSRELPSHGSWCLP 

SEQ ID NO: 146_W30246_M |GK324 M pEGVNGNRCS ESFPLLEKYR 
TKS S S S S PT S PG S FRGLKI p &LKI^[ DKAKCCGKEHL I ENEV S I LRRVKH PN 1 1 ML 

SEQ ID NO: 147_AA3 83293 H SAVQAPLAV RALYTPCHGHPV 
PAAKRVVVYRNGDPFFPGSQLVVTQRRFPTMEAFLCE^ 

KLLTEKVKLQSGAWLCTLEGLPLSAG^^^^ 

GNEAHLRSGVGTVAGSPKPLGRKAKKETCLIVTLTLKy^ FFGDDDVFIACGPEK FRYA 




EPKTRPEENKPERPSGRKPRPMGIIAA^^ 
MKIIDKSRLKGKEDMVDSEILII^LS^ 

lESVKFPEPDAALMIMDLC^^DKSr^RDL^E LCGFppFRSPXXGDQDE 
KRQKQVS P S SDGHFRSQHKRWEQVS 

SEQ ID NO: 148 - AA19788 ^ DDTDrHRG p C DHSLKCLSSKISERKLPGPWLPAGR 
MPTAPVLRPPPPPATPAPPAPSRPAPP^GHRGPCDH^ 

GPLEKPVLG PRGAVMPLF S p^^^pY^^^Qpgy ^S VSDFFREGDAF I AMGKE PLTLK 
VQTFEQLLSDI SEALGFPRWKNDRVRKLFTLKGREVKb SYSAEMESKA VS 

Sqi^eelypknralalaphsrv™ 

IVE^KFPEPEAAVMITDLC^F^MHDralVHRD™ VIIjYILLCGF ppF R SPE R DQDE 

nsqkeespnslghfqsqhkkvaeqmp 

SEQ ID NO: 14 9_DRAK2 H ILTSKELGRG KFAVVRQCISKSTGQEYA 

msrrrfdcrsisglltttpqipi^enfnwfyil^ 

AKFLKKRRRGQDCRAEILHEIAVLELAKSCP^ 

l^lpelaemvsendvirlikqileg^ylhqnnivh IIAYMLLT htspfvgedn 
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FIGURE 1G 

.PHPHHTSSSSOTQOHSVHSSeOKTSKSSCNCTCOOREOKBKXPBOSSMVSKKrRPDOSL 
PNPHELVSDLLC 

SEQ ID NO: 1 50 - W ^-"°^fYTLTPKELGRGKFAWRQCISKSTGQEYA 
MSRRRFDCRSVSGLLTTTPQTPIKTE^FYTLTP^ NATEIILV LEYAAGGEIFN 
AKSLKKRRRGQDCRAE I I AV^ELA^SCP^IN^H i LLS S I Y PLGD I KI VDF 

LCLPELAEMVS ENDV I RLI KQ I ^?Y?^^p j^^jpj^viN I G 1 1 A YMLLTHT S P FVGEDN 
GMSRKIGNASELREIMGTPEY^PEIL^DPITTATO 

PSPHELVPDLFC 

SEQ ID NO: 151_H012 4 J_»' °^-^ pppppQARGLL TEIRAVVRTEPFQDGYS 
M I PLEKPGSGGS S ^^"^^ ^^^?^™pKRRKGQDCRMEI I HE I AVLELAQDNPW 

LCPGRELGRGKFAWRKCI ^^^^p^py^^gg^pKEKDVQRLMRQI LEGVHFLHTR 
VINLHEVYETASEMILVLEYA^GGE^DQCVADR^ 

DVVHLDLKPQNILLTSESPLGDIKI^ 
^IGY^ 

MLCHPHIIRLYQVMETERMIYLVTEYAS^ 

CHCRNI VHRDLKAENLLLDANLNIKI ^^^^ RVL ^ GKFRIPF FMSTECEHLIRHML 
YDGPKVDIWSLGWLYVLVCGALPFB^STLQN^ 

VLDPNKI^SMEQICKHK^ 

LEKTQQQHMLYQQEQHHQILQQQIQDSICPW^ 

SPPPNHPNNHLFRQPSNSPPPMSSAMIQPHGAAS^ 

VALTCLGMQQPAQSQQVTIQVQEP^SNMPGT^ 

SEQ ID NO: 153_2R22-5-ll H TESSKEGEEGQ pRQLTPFEKLTQDMSQDEK 
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FIGURE 1H 

RVYRGIRHTSKFCSIL 

SE Q ID NO: 154_R31237_1 H, ^33487 SCADEQPH IGNYRLLK 

"ksseldasdsssssnls^pps™ 

Sgpaipsotaypkrsqtstmjgdlkedgissrkssgs GKENSTIPDQ rtpvas 

^PERKKSSTVPSSNTASGGMTR^WCSERTTADRH HEATpLSQTRSR G 
THSISSAATPDRIRFPRGTASRSTFHG^RERRTAT MDpGDMMRE IRKVLD 

SKIANELKL 

^VCTERPGSERPSLLPNGK^SSG^ 



PERI 



SEQ ID NO: 156 406786 . ^— ^ aaoTT aEPSRSFSSAHRHLSRRNGLSRLCQS 

mISggltvfeedqrclsqslplpvsae^ 

RTALSEDRWSSYCLSSI^QNICTSKLHCPAAPEHTD QKLTQFF LRSDSDWE 

PLLPAPVCNPNKAI P^VDAKTTE ^ TRSG^^ WMKRMRQERRLCCVWLEPVERVST 
ALSEEHMEADGHAAWFGTVVDIITRSGEKIPVSV LP p SGQH IPKNLKIQRSV 

LSPQPAPGVDNVPEGSLPVHGEQALPKDQQITALGKt DLWAGAA VAKPQAKGQ 
SSlQ^VEI^PTPLFCCTL^LWSQ^SAART FGFWrAVDKG 
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FIGURE 11 

IVlAEDFTIKLIDFGSAAYLERG^TrcGTIEYCAPEVL LEKLVTDpwvxQ 
HPGDPRLLTS 

SEQ ID NO: 157_AA544 838 M 40678* >^ IVIAEDFTIK LIDFGSAAYL 

TRPHPCLDEPIASFIFRQLVSAVGYLHS^ 

ERGKLFYTFCGTIEYCAPEVLIGNPYRG^^^ 

GLLSAASLEIGSRSPSEMA^ 

PYAAPEVFEGQQYEGPQLDIWSMGVyLYVWGALPtu QEQENEPSIGEF NEQV 

sed^hlirrmlvldpskrltiaqikeh^^ 
SlmLlgidqqkxieslqnksynhfaaiyfllver^^ 

aeqtvakaqtvglpvtmhspn^llrsa^ 

pSclldpvppvlwkgcqs^^ 

RRHTLSEVTNQLVVMPGAGKIFS^ 

ANQPSPRMTSPFISLRPTOPAMQALSSQ^EVH^P ^gscPQEEVSQQQESVST 
HLQNLARTKGILELNKVQLLYEQIGPE^ 

LPASVHPQLSPRQSLETQYLQHRLQKPSLLSKAQN1CU FSLTQPLS p V LEP 

rlfLkqWyfnqmqiaess^ 
selpglfdcemldavdpqhngyvlvn 

SEQ ID NO : 159 -^°7220 H KQAVKRHH HKHNLRHRYEFLETLG 
MESLVFM*SGPTPSAAE^P^ 

kgtygkvkkaressgrlvaiksir^ki^^hi avhychqnrvv „ RDL kle N 

N S S KI V I VMEY ASRGDL YD Y I p^y PE I VNGKP Y TGPE VD S W S LGVLL 

ILLDANGNIKIADFGLSNLYHQGKFLQTFCGSPL^ 

yiLVHGTMPFDGHDHKILVKQISNGAYREPP^ 

HWVmOWGYATRVGEQEAPHEGGHPGSDSARASMADW ^p KGI . LK KKVSASA 

gggsttpglerqhslkksrkend^^ 
lgdscfsltdcqevtatyrqalrvcsklt 

SEQ ID NO: 160_AA426580_H, FLPAW VSGVPRERLRDFQHHKRVGN 

mpaaagdgllgepaapgggggaedaarp^ 

y^igsrklgegsfakvreglhvltge^ai^id^kkd^ RRYIRQLI savehlhra 

pnitqlldiletensyylvmelcpggn^ 

g™rdlkienllldednnikli^ 

V H!) 
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FIGURE 1J 

DVINTVLSNRACHILAIYFLLNKKLERYLSGKSDIQDSLCYKTRLYQIEKYRAPKESYEA 
SLDTWTRDLEFHAVQDKKPKEQEKRGDFLHRPFSKKLDKNLPSHKQPSGSLMTQIQNTKA 
LLKDRKASKSSFPDKDSFGCRNIFRKTSDSNCVASSSMEFIPVPPPRTPRIVKKPEPHQP 
GPGSTGIPHKEDPLMLDMVRSFESVDRDDHVEVLSPSHHYRILNSPVSLARRNSSERTLS 
PGLPSGSMSPLHTPLHPTLVSFAHEDKNSPPKEEGLCCPPPVPSNGPMQPLGSPNCVKSR 
GRFPMMGIGQMLRKRHQSLQPSADRPLEASLPPLQPLAPVNLAFDMADGVKTQC 

SEQ ID NO: 161_Z36720_H 

MDTKLNMLNEKVDQLLHFQEDVTEKLQSMCRDMGHLERGLHRLEASRAPGPGGADGVPHI 

DTQAGWPEVLELVRAMQQDAAQHGARLEALFRMVAAVDRAIALVGATFQKSKVADFLMQG 

RVPWRRGSPGDSPEEWVKEEEVCFMPPVPPAPGAAGQSLQKDKGELSAEQGIWATLMTLV 

I MVTAANKERVEEEGGKPKHVLSTSGVQSDAREPGEESQKADVLEGTAERLPP I RASGLG 

ADPAQAWSPGQGDGVPGPAQAFPGHLPLPTKVEAKAPETPSENLRTGLELAPAPGRVNV 

VSPSLEVAPGAGQGASSSRPDPEPLEEGTRLTPGPGPQCPGPPGLPAQARATHSGGETPP 

RAALLKGAVAPGFSRRDLVFPSIFCACLGISIHIQEMDTPGEMLMTGRGSLGPTLTTEAP 

AAAQPGKQGPPGTGRCLQAPGTEPGEQTPEGARELSPLQESSSPGGVKAEEEQRAGAEPG 

TRPSLARSDDNDHEVGALGLQQGKSPGAGNPEPEQDCAARAPVRAEAVRRMPPGAEAGSV 

VLDDSPAPPAPFEHRWSVKETS I SAGYEVCQHEVLGGGRFGQVHRCTEKSTGLPLAAKI 

I KVKS AKDREDVKNE INI MNQLSHVNLI QLYDAFES KHS CTLVME YVDGGELFDR I TDEK 

YHLTELD WLFTRQ I CEGVHYLHQHY I LHLDLKPEN I LC VNQTGHQ I KI I DFGLARRYKP 

REKLKVNFGTPEFLAPEWNYEFVSFPTDMWSVGVITYMLLSGLSPFLGETDAETMNFIV 

NCSWDFDADTFEGLSEEAKDFVSRLLVKEKSCRMSATQCLKHEWLNNLPAKASRSKTRLK 

SQLLLQKYIAQRKWKKHFYWTAANRLRKFPTSP 
SEQ ID NO: 162_SGK088_H 

GEMALFECLVAGPTDVEVDWLCRGRLLQPALLKCKMHFDGRKCKLLLTSVHEDDSGVYTC 
KLSTAKDELTCSARLTVRPSLAPLFTRLLEDVEVLEGRAARFDCKI SGTPPPWTWTHFG 
CPMEESENLRLRQDGGLHSLHIAHVGSEDEGLYAVSAVNTHGQAHCSAQLYVEEPRTAAS 
GPSSKLEKMPSIPEEPEQGELERLSIPDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISW 
FHNGHRIQSSDDRRMTQYRDVHRLVFPAVGPQHAGVYKSVIANKLGKAACYAHLYVTDW 
PGPPDGAPQWAVTGRMVTLTWNPPRSLDMAIDPDSLTYTVQHQVLGSDQWTALVTGLRE 
PGWAATGLRKGVQHIFRVLSTTVKSSSKPSPPSEPVQLLEHGPTLEEAPAMLDKPDIVYV 
VEGQPASVTVTFNHVEAQWWRSCRGALLEARAGVYELSQPDDDQYCLRICRVSRRDMGA 
LTCTARNRHGTQTCSVTLELAEAPRFESIMEDVEVGAGETARFAVWEGKPLPDIMWYKD 
EVLLTESSHVSFVYEENECSLWLSTGAQDGGVYTCTAQNLAGEVSCKAELAVHSAQTAM 
EVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRIVERSSGLEFAAKFIPSQAKPKAS 
ARREARLLARLQHDCVLYFHEAFERRRGLVIVTELCTEELLERIARKPTVCESEIRAYMR 
QVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQVRICDFGNAQELTPGEPQYCQYGTP 

EFVAPE I VNQ S PVS G VTD I W P VGWAFLCLTG ISP F VGENDRTTLMN I RNYNVAFEETTF 
LSLSREARGFLIKVLVQDRLRPTAEETLEHPWFKTQAKGAEVSTDHLKLFLSRRRWQRSQ 
ISYKCHLVLRPIPELLRAPPERVWVTMPRRPPPSGGLSSSSDSEEEELEELPSVPRPLQP 
EFSGSRVSLTDIPTEDEALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA 
GASPRRGELRRGSSAESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEG 
EYAQRLQALRQRLLRGGPEDGKVSGLRGPLLESLGGRARDPRMARAASSEAAPHHQPPLE 
NRGLQKSSSFSQGEAEPRGRHRRAGAPLEIPVARLGARRLQESPSLSALSEAQPSSPARP 
SAPKPSTPKSAEPSATTPSDAPQPPAPQPAQDKAPEPRPEPVRASKPAPPPQALQTLALP 
LTPYAQIIQSLQLSGHAQGPSQGPAAPPSEPKPHAAVFARVASPPPGAPEKRVPSAGGPP 
VLAEKARVPTVPPRPGSSLSSSIENLESEAVFEAKFKRSRESPLSLGLRLLSRSRSEERG 
PFRGAEEEDGIYRPSPAGTPLELVRRPERSRSVQDLRAVGEPGLVRRLSLSLSQRLRRTP 
PAQRHPAWEARGGDGESSEGGSSARGSPVLAMRRRLSFTLERLSSRLQRSGSSEDSGGAS 
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FIGURE IK 

GRSTPLFGRLRRATSEGESLRRLGLPHNQLAAQAGATTPSAESLGSEASATSGSSAPGES 
RSRLRWGFSRPRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHIKLKDQVLLEGEAA 

TLLCLPAACPAPHI SWMKDKKSLRSEPSVI I VSCKDGRQLLSI PRAGKRHAGLYECSATN 
VLGS I TS SCTVAVARVPGKLAPPEVTQTYQDTALVLWKPGDSRAPCTYTLERRVDGESVW 

HPVS SG I PDC YYNVTHLPVGVTVRFRVACANRAGQG PF SN S SEKVF VRGTQDS S AVPSAA 
HOEAPVTSRPARARPPDSPTSLAPPLAPAAPTPPSVTVSPSSPPTPPSQALSSLKAVGPP 
POTP PRRHRGLQAARPAEPTLPSTHVTPSE PKPFVLDTGTP I PASTPQGVKPVS S STPVY 
WTSFVSAPPAPEPPAPEPPPEPTKVTVQSLSPAKEWSSPGSSPRSSPRPEGTTLRQGP 
POKPYTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHER 
IMSLHEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATYMVQLLQGLDYLHGHHV 
LHLDIKPDNLLLAPDNALKIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEMVKGEPIGSA 
TDIWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLSV 
HPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLLR 

SYPGGP 

SEQ ID NO: 163_AA542015_M SGK088_M 

ATDIWGAGVLTYIMLSGYSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLS 
VHPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRHKVLL 

RSYPGSP 

SEQ ID NO: 164_R19772_H 

MKGGDRAYTRGPSLGWLFAKCCCCFPCRDAYSHSSSENGGKSESVANLQAQPSLNFIHSS 
PGPKRSTNTLKKWLTSPVRRLNSGKADGNIKKQKKVRDGRKSFDLGSPKPGDETTPQGDS 
ADESKKGWGEDEPDEESHTPLPPPMKIFDNDPTQDEMSSSLLAARQASTEVPTAADLVNA 
IEKLVKNKLSLEGSSYRGSLKDPAGCLNEGMAPPTPPKNPEEEQKAKALRGRMFVLNELV 
OTEKDYVKDLGIWEGFMKRIEEKGVPEDMRGKDKIVFGNIHQIYDWHKDFFLAELEKCI 
OEODRLAQLFIKHERKLHIYVWYCQNKPRSEYIVAEYDAYFEEVKQEINQRLTLSDFLIK 

P I OR I TKYQLLLKDFLRY S EKAGLEC SD I E KAVELMCLVPKRCNDMMNLGRLQGFEGTLT 
AOGKLLQQDTFYVIELDAGMQSRTKERRVFLFEQIVIFSELLRKGSLTPGYMFKRSIKMN 
YLVLEENVDNDPCKFALMNRETSERWLQAANADIQQAWVQDINQVLETQRDFLNALQSP 
IEYQRKERSTAVMRSQPARLPQASPRPYSSVPAGSEKPPKGSSYNPPLPPLKISTSNGSP 
GFEYHQPGDKFEASKNDLGGCNGTSSMAVIKDYYALKENEICVSQGEWQVLAVNQQNMC 

LVYQPASDHS PAAEGWVPGS I LAPLTKATAAES SDGS I KKSCS WHTLRMRKRAEVENTGK 
NEATGPRKPKDILGNKVSVKETNSSEESECDDLDPNTSMEILNPNFIQEVAPEFLVPLVD 
VTCLLGDTVILQCKVCGRPKPTITWKGPDQNILDTDNSSATYTVSSCDSGEITLKICNLM 
PODSGIYTCIATNDHGTTSTSATVKVQGVPAAPNRPIAQERSCTSVILRWLPPSSTGNCT 
ISGYTVEYREEGSQIWQQSVASTLDTYLVIEDLSPGCPYQFRVSASNPWGISLPSEPSEF 
VRLPEYDAAADGATISWKENFDSAYTELNEIGRGRFSIVKKCIHKATRKDVAVKFVNKKM 
KKKEQAAHEAALLQHLQHPQYITLHDTYESPTSYILILELMDDGRLLDYLMNHDELMEEK 

VAFY I RDI MEALQYLHNCRVAHLDI KPENLLI DLRI PVPRVKLIDLEDAVQI SGHFHIHH 
LLGNPEFAAPEVIQGIPVSLGTDIWSIGVLTYVMLSGVSPFLDESKEETCINVCRVDFSF 
PHEYFCGVSNAARDFINVILQEDFRRRPTAATCLQHPWLQPHNGSYSKIPLDTSRLACFI 

ERRKHQNDVRP I PNVKSY I VNRVNQGT 
SEO ID NO: 165_5R72_8_2_H 

MAD SGLDKKS TKC PDC S S ASQKD VLCVCS S KTR VP PVL WEM S QT S S I G S AE S L I S LE R K 
KEKNINRDITSRKDLPSRTSNVERKASQQQWGRGNFTEGKVPHIRIENGAAIEEIYTFGR 
ILGKGSFGIVIEATDKETETKWAIKKVNKEKAGSSAVKLLEREVNILKSVKHEHI IHLEQ 
VFFTPKKMYLVMELCEDGELKEILDRKGHFSENETRWIIQSLASAIAYLHNNDIVHRDLK 
LEN I MVKS S L I DDNNE I NLN I KVTDFGLAVKKQSRS EAMLQATCGTP I YMAPEVI S AHD Y 
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FIGURE 1L 

SOOCDIWSIGVVMYMLLRGEPPFLASSEAKLFELIRKGELHFENAVWNSISDCAKSVLiKQ 
S^PA^ITAKELLDNQWLTG^ 

Sk^qpwgnvpetnytsdeeeekqstayekqfpatskdnfdmcsssftsskllpae 

I KGEMEKTPVTPSQGTATKYPAKSGALSRTKKKL 
cpn in NO- 166 SGK3 09 H 

moci^^etnmsgggeqaBilpanywkdrwkvlkkiggggfgeiyeai^lltre^ 
S^Sogrni^lrrsqprgtftlsttlrlgkqilesieaihsvgflhrdikpsnf 

m WSLFYMLVEFAVGQLPWRKI KDKEQVGMI KEKYEHRMLLKHMPSEFHLFLDHI ASLDY 

gocdasawgpLLhrgcatgrapexpgectpnsarealxgagpqsppcppprgsxg^ 

ggdrcqpeq?pdqhrqsncrqge 
pvpspalaslclpsssssvsftlrrpsa 



SdhSml^pIfs^hissldyftkpdyqlltsvfdnsiktfgviesdpf 

rMnGSLTTCTTSTTPQLHTRLTPAAIGIANATPI PGDLLRENTDEVFPDEQLSDGENGI P 

vgSspdSpgslgSprpqe™^ 
?f?iffem?wd!eS 

Sish^geIkSfgi^raksipsqtyssevvtlwyrppdallgateysse 
fiemfqgqplfpgvsnileqlekiwevlg^ 

™r[grSpe!edlasqmlkgfprdrvsaqea L vhdyfsalpsqlyqlpdeeslftvsgv 
rlkpemcdllasyqkghhpaqfskcw 

ay^Itrwyrapellvgdtqygssvdiwaigcvfaelltgqplwpgk 
^gkSprhqsifksngffhgi s I pepedmetleekfsdvhpvalnfmkgclkmnpddrl 

TCSQLLESSYFDSFQFAQIKRKARNEGRN 
FDHLPNI 

KT ALREI RMLKLKHPNLVNLI EVFRRKRKMHLVFEYCDHTLLNELERNPNGVSDGVI KSV 

^pelSSgIvdwavgcvfaelltgqplwpgksdvdqlyliirtlg^ 
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FIGURE 1M 

DliKAKQTEKEKSQEVSSKSGSMKDRISGSSIWSNEETODYGKAQ^KSSSl^SRSSK^ 

KGSPVFLPRKENSSVEAKDSGLESKKLPRSVI^EK&APD^ pTIASP ppp LP 
SOPPWSVKTQVSVTAAIPHLKTSTLPPLPLP^ 

gwh^Urpieesllnpelirppipeg 
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FIGURE IN 

SEQ ID NO: 175_AA631990_H FSAEGpFAFAEMR HSKRTHCPDW 
RI TLDEALQHPFFDLLKKK 

PEHSPSWPSSRLRLSPQEFGDHPNIISLLDVIP^EN^ 

gVsPSQAHLHKPRADPQLPSRTPVQGPRPRPQ^S^ 
EGHHV 

SEQ ID NO: 177_N28606_H MOK H QVNNLREIQALR RLNPH 
MKNYKAIGKIGEGTFSEVMKMQS^^ 

PNILMLHEWFDRKSGS^LICELMD^IYEL^RG^RRY g-j-RWYRAPECLLTDGFYT 

RNG I FHRDVKPEN I L I ^^Y^^^P^f j gjQ HDVI GTPAQKI LTKFKQSRAMNFDF P 
YKMDLWSAGCVFYEIASLQPLFPGWELDQISKI^ 

SEQ ID NO: 1 78 J*°""? ? ?' el^aIKKMKRKFYSWEECMNQREVKSLKKLNHA 
MNRYTTIRQLGDGTYGSVLLGRSIESGELIAIKKMKK^ MY QILQGLAFIHKLG 

EKPQKGILERAGPPPYIKPVPPAQPPAKPHTR^SRQHQASQPP^H^^^^^^^^^ 




qprSpgSprppaaqpvhgrtdmaskypspr 

SEQ ID NO: l 7 9_ AA839940 - M _ Dri „ Dr ,„ TVr , TT DLQQGIDPGAVSPEPGKDHAAQ 
IsNNGGMSAEEEIGPGAEP^GPS^TRDWRDE^TTDLM QHEVLGQGRF 

GPGRTEAGRVSSAAEAAIVVI^DSA^PPAPFE^R^SI^^^^^^^^^g^gp^ 
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FIGURE 10 

OTGHQI KI IDFGLARRYKPREKLKVNFGTPEFLAPEWNYEFVSFPTDMWSVGVITYMLL 
SGLSPFLGETDAETMNFIVNCSWDFDADTFKGLSEEAKDFVSRLLVKEKSCRMSATQCLK 
HEWLNHLPAKASGSNVRLRSQQLLQKYMAQSKWKKHFHWAAVNRLRKFPTCP 

SEO ID NO: 180 AA460132_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 

HRF P KG YRH PALEARLGRRRTVQEARALLRCRRAG I SAP WFFVDYASNCLYMEEI EGSV 
TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

SEO ID NO: 181 SGK034_H 

OREKVNOGNMPGLQSTFLAMDTEEGVEVVWNELHFGDRKAFAAHEEKIQTVFEQLVLVDH 
PNIVKLHKYWLDTSEACARVIFITEYVSSGSLKQFLKKTKKNHKAMNARAWKRWCTQILS 
ALSFLHACSPPIIHGNLTSDTIFIQHNGLIKIGSVWHRIFSNALPDDLRSPIRAEREELR 

NT PFFPPEYGEVADGTAVDI FSFGMCALEMAVLEIQTNGDTRVTEEAI ARARHSLSDPNM 
REFILCCLARDPARRPSAHSLLFHRVLFEVHSLKLLAAHCFIQHQYLMPENVVEEKTKAM 
DLHAVLAELPRPRRPPLQWRYSEVSFMELDKFLEDVRNGIYPLMNFAATRPLGLPRVLAP 
PPEEVQKAKTPTPEPFDSETRKVIQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLL 

PTD S AQDLAS ELVH YG FLHEDDRMKLAAFLE S TFLKYRGTQA 
SEO ID NO: 182 AA103218_M SGK034_M 

HASAPEYGEVNDGTGFVDIFSFGMCALEMAVLEIQANGDTRVTEEAIARARHSLSDPNMR 
EFILSCLARDPARRPSAHNLLFHRVLFEVHSLKLLAAHCFIQHQYLMPENWEEKTKAMD 
LHAVLAEMPQPHGPPMQWRYSEVSFLELDKFLEDVRNGIYPLMNFAAARPLGLPRVLAPP 
PEEAQKAKTPTPEPFDSETRKWQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLLP 

TDSAQDLAAELVHYGFLHEDDRTKLAAFLETTFLKYRGTQA 
SEO ID NO: 183 NEK7 H, N34132_H 

MSGGAAEKQSSTPGSLFLSPPAPAPKNGSSSDSSVGEKLGAAAADAVTGRTEEYRRRRHT 

MDKDSRGAAATTTTTEHRFFRRS VI CDSNATALELPGLPLSLPQPS I PAAVPQSAPPEPH 
REETVTATATSQVAQQPPAAAAPGEQAVAGPAPSTVPSSTSKDRPVSQPSLVGSKEEPPP 
ARSGSGGGSAKEPQEERSQQQDDIEELETKAVGMSNDGRFLKFDIEIGRGSFKTVYKGLD 
TETTVE VAWCELQDRKLTKSERQRF KEEAEMLKGLQH PN I VRFYDS WE S TVKGKKC I VLV 
TELMTSGTLKTYLKRFKVMKIKVLRSWCRQILKGLQFLHTRTPLI IHRDLKCDNI FITGP 
TGSVKIGDLGLATLKRASFAKSVIGTPEFMAPEMYEEKYDESVDVYAFGMCMLEMATSEY 

PYSECONAAQI YRRVTSGVKPAS FDKVAI PEVKEI IEGCI RQNKDERYS I KDLLNHAFFQ 
EETGVRVELAEEDDGEKIAIKLWLRIEDIKKLKGKYKDNEAIEFCFDLERDVPEDVAQEM 

VES GYVCEGDHKTMAKAI KDRVS LI KRKREQRQLVREEQENKKQEES SLKQQVEQ S S ASQ 
TGTKOLPSASTGIPTASTTSASVSTQVEPEEPEADQHQQLQYQQPSISVLSDGTVDSGQG 
SSVFTESRVSSQQTVSYGFPXHEQAHSTGTVPGHIPSTVQAQSQPHGVYPPSSVQQGIQQ 
TAPPOOTVQYSLSQTSTSSEATTAQPVSQPQAPQVLPQVSAGKQSTQGVSQVAPAEPVAV 
AOPOATOPTTLASSVDSAHSDVASGMSDGNENVPSSSGRHEGRTTKRHYRKSVRSRSRHE 
KTSRPKLRILNVSNKGDRVVECQLETHNRKMVTFKFDLDGDNPEEIATIMVNNDFILAIE 
RESFVDOVREIIEKADEMLSEDVSVEPEGDQGLESLQGKDDYGFSGSQKLEGEFKQPIPA 

S SMPOO I G I PTS S LTQ WH SAGRRF I VS P VPE S RLRE S KVFPS E I TDTVAA S TAQ S PGMN 
LSHSASSLSLQQAFSELRRAQMTEGPNTAPPNFSHTGPTFPWPPFLSSIAGVPTTAAAT 

APVPATSSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPIPPVSESPVLSSVV 

qqiTIPAWSISTTSPSLQVPTSTSEIWSSTALYPSVTVSATSASAGGSTATPGPKPPA 

VVSQQAAGSTTVGATLTSVSTTTSFPSTASQLSIQLSSSTSTPTLAETVVVSAHSLDKTS 
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FIGURE IP 

HSSTTGLAFSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGPTSTPL 
LPOVPSIPPLVQPVANVPAVQQTLIHSQPQPALLPNQPHTHCPEVDSDTQPKAPGIDDIK 
TLEEKLRSLFSEHSSSGAQHASVSLETSLVIESTVTPGIPTTAVAPSKLLTSTTSTCLPP 
TNLPLGTVALPVTPWTPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTL 
PSEQLPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAPTAITEAGTQP 
OKGVSOVKEGPVLATSSGAGVFKMGRFQVSVAADGAQKEGKNKSEDAKSVHFESSTSESS 
VLSSSSPESTLVKPEPNGITIPGISSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTAN 
KVGRFSVSKTEDKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNGPSSD 
PEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNLSQSLSNSFNSSYMSSDNESDIEDEDLK 
LELRRLRDKHLKEIQDLQSRQKHEI ESLYTKLGKVPPAVI I PPAAPLSGRRRRPTKSKGS 

kSrssslgnkspqlsgnlsgqsaasvlhpqqtlhppgnipesgqnqllqplkpspssdn 

LYSAFTSDGAI S VP S LS APGQGNKAT I IVQKQ 
SEO ID NO- 184 BCON3 H 

msegesqtvlssgsBpkvessssapgltsvsppvtsttsaaspeeeeesedeseileesp 
cgrwqkrreevnqrnvpgidsayi^dteegvevvwnevqfserknyklqeekvravfdn 
liolehlnivkfhkywadikenkarvifiteymssgslkqflkktkknhktmnekawkrw 

SSLSA^SYLLcDPPIIHGNLTCDTIFIQHNGLIKIGSVAPDTINimVKTCREEQ^ 
HFFAPEYGEVTNVTTAVDIYSFGMCALEMAVLEIQGNGESSYVPQEAISSAIQLLEDPLQ 

refiokclqseparrptarellfhpalfevpslkllaahcivghqhmipenaleeitknm 

DTSAVLAEIPAGPGREPVQTLYSQSPALELDKFLEDVRNGIYPLTAFGLPRPQQPQQEEV 
TSPWPPSVKTPTPEPAEVETRKWLMQCNIESVEEGVKHHLTLLLKLEDKLNRHLSCDL 
MPNENIPELAAELVQLGFISEADQSRLTSLLEETLNKFNFARNSTLNSAAVTVSS 

SEO ID NO- 185 AA711829_M 

LKQFLKKTKKNHKTMNEKAWKRWCTQILSALSYLHSCDPPI IHGNLTCDTIFIQHNGLIK 
I GS VAPDT I NNHVKTCREEQKNLHFFAPEYGEVTNVTTAVD I Y S FGMCALEMAVLE I QGN 
GESSYVPQEAISSAIQLLEDSLQREFIQKCLQSEPARRPTARELLFHPALFEVPSLKLLA 

^C^HQHMIPENALEEITKNMDTSAVl^EIPAGPGREPVQ^ 

RNGIYPLTAFGLPRPQQPQQEEVTSPWPPSVKTPTPEPAEVETRKVVLMQCNIESVEEG 
VKHHLTLLLKLEDKLNRHLSCDLMPNE S I PDLAAELVQLGF I SEADQSRLTSLLEETLNK 

fnftrnstlntatvtvs s 

seo id no: 186 aa099102 h 

msscvssqpssnraKpqdelggr^ssssesqkpcealrglsslsihlgmesfiwtecep 
gcav^lgSrdrpleadgqevpldtsgsqarphlsgrklslqersqgglaaggsldmngr 
cicpslpyspvsspqssprlprrptveshhvsitgmqdcvqlnqytlkdeigkgsygwk 
laynendntyyamkvlskkklirqaafprrppprgtrpapggciqprgpieqvyqeiail 
kkldhpnwklvevlddpnedhlymvfelvnqgpvmevptlkplsedqarfyfqdlikgi 
eylhyqkiihrdikpsnllvgedghikiadfgvsnefkgsdallsnyvgtpafmapesls 
etrkifsgkakdvwamgvtlycfvfgqcpfmderimclhskiksqalefpdqpdiaedlk 
dlitrmldknpesriwpeiklhpwvtrhgaeplpsedenctlvevteeevensvkhips 
Stvilvktmirkrsfgnpfegsrreerslsapgnlltkkptreceslselkearqrrqp 
pghrpaprggggsalvrgspcvescwapapgsparmhplrpeeamepe 

SFO TD NO- 187 5R69 17 2 H 

moeipoeqikeikkeqlsgspwYllrenevstlykgeyhrapvaikvfkklqagsiaivr 



PCT/US00/14842 

WO 00/73469 

FIGURE 1Q 

L 

SEQ ID NO: 188_H85811 H EpssNWDMTG y G SHSKVYSQSKNIPLS 

EGDYQLVQHEVLCSMTNTYE^EFLGRGTFGQWKC^ 
QIEVSILARLSTESADDYNFVRAYECFQHI^HTC 

IRPVLQQVATALMKLKSLGL I ^^^^pf^HpLY PGDSE YDQ I RY I SQTQGLP 

^WRNTHAHGSHYNPIMQQPALLTGH^P^QP^ 

Q^VRNVSTCEVSSSQAISSPQRS^V^PP^^ SKQRKN VISCVTVHDSPYS 
RERQRQTIVIPDTPSP^SVITISSDTDEEE^QKHAPT ppLKTQASEV LVECDSLV 

dSsntspysvqqraghnnan^ 

^TSHHSSSYKSKSSSNVTSTSGH^ 
^ISASpfsTVYTGYPLSPAKVNQYPYI 

q E 0 ID NO: 18 9_DYRK3_H XTM ^ Atrn pTrnHTOHFLDGGEMKVEQLFQEFGNR 
SIPLCSVLPKLIS 

QEO ID NO: 190 M589241_M D ™^-" v - totdgrw1i lGGRSRRGKKEGPPG 
SE Q ID NO: l^ 1 -^ 72 - 1 ^" nALEAIYG ADFQDLRPDACGPVKEPPEINLVLY 

^fS?hv 0 sflseh N kpppksfhe^p^eme^e^ mspd ^ ongkhr 
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FIGURE 1R 

YLAMNLKEQDDSIVVDILVEHISGVSLAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNS 
WHKVLSASNVLVDAEGTVKITDYSISKRLADICKEDVFEQTRVRFSDNALPYKTGKKGD 
VWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFIN 
PQPKMPLVEQSPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 
FGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVRYYNAWIERHE 
RPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSAS 
ARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDI I FDNEDENSKSQNQDEDCNEK 
NGCHESEPS VTTEAVHYLY I QME YCEKSTLRDT I DQGLYRDTVRLWRLFRE I LDGLAY I H 
EKGMIHRDLKPVNI FLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLI KSDPSGHLTG 
MVGTALYVSPEVQGSTKSAYNQKVDLFSLGI I FFEMSYHPMVTASERI FVLNQLRDPTS P 
KFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLT 
NVDGKAYRTMMAQI FSQRIS PAIDYTYDSDI LKGNFS I RTAKMQQHVCET I IRI FKRHGA 
VQLCTPLLLPRNRQI YEHNEAALFMDHSGMLVMLPFDLRI PFARYVARNNILNLKRYCI E 
RVFRPRKLDRFHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEI IQEFPAL.QERNYSIYL 
NHTMLLKAILLHCGIPEDKLSQVYIILYDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKF 
I EQKGDLQDLMPTI NSLI KQKTG I AQLVKYGLKDLEEWGLLKKLG I KLQVLI NLGLVYK 
VQQHNG 1 1 FQ FVAF I KRRQRAV P E I LAAGGR YDLL I PQFRG PQALG P VPT A I G VS I A I DK 
ISAAVLNMEESVTISSCDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQ 
EYCRHHEITYVALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNG 
REASDNLAVQNLKGSFSNASGLFEIHGATWPIVSVLAPEKLSASTRRRYETQVQTRLQT 
SLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFNTTVKQLLSRLPKQRYLKLVC 

DEI YNI KVEKKVS VLFLYS YRDDYYRI LF 
SEQ ID NO: 192_R43524_H ( HRI_H 

MLGGNSGVRKREEEGDGAGAVAAPPAIDFPAEGPDPEYDESDVPAEIQVLKEPLQQPTFP 
FAVANQLLLVSLLEHLSHVHEPNPLRSRQVFKLLCQTFIKMGLLSSFTCSDEFSSLRLHH 
NRAITHLMRSAKERVRQDPCEDISRIQKIRSREVALEAQTSRYLNEFEELVILGKGGYGR 
VYKVRNKLDGQYYAIKKILIKGATKTVCMKVLREVKVLAGLQHPNIVGYHTAWIEHVHVI 
QPRADRAAIELPSLEVLSDQEEDREQCGVKNDESSSSSIIFAEPTPEKEKRFGESDTENQ 
NNKSVKYTTNLVIRESGELESTLELQENGLAGLSASSIVEQQLPLRRNSHLEESFTSTEE 
S SEENVNFLGQTEAQYHLMLHI QMQLCELSLWDW I VERNKRGREYVDESACPYVMANVAT 
KI FQELVEGVFY I HNMGI VHRDLKPRNI FLHGPDQQVKI GDFGLACTDI LQKNTDWTNRN 
G.KRTPTHTSRVGTCLYASPEQLEGSEYDAKSDMYSLGWLLELFQPFGTEMERAEVLTGL 
RTGQLPESLRKRCPVQAKYI QHLTRRNSSQRPSAIQLLQSELFQNSGNVNLTLQMKI I EQ 
EKE I AELKKQLNLLSQDKGVRDDGKDGGVG 

SEQ ID NO: 193_17000057519457_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPWFFVDYASNCLYMEEIEGSV 

TVRD YIQSTMETEKTPQGLSNLAKT I GQVLARMHDEDL I HGDLTTSNMLLKP PLEQLN I V 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

SEQ ID NO: 194_AA013524_M 

LVQQGAEARVFRGRFQGRAAWKHRFPKSYRHPELEARLGRRRTVQEARALLRCRRAGIA 
APWFFVDYASNCLYMEEIEDSVTVRDYIQSTMETEKDPQCLLDLARRMGQVLAGMHDQD 
LIHGDLTTSNMLLRRPLAQLHIVLIDFGLSFVSGLPEDKGVDLYVLEKAFLSTHPHTETA 

FEAFLKSYGASSKKSSPVLKKLDEVRLRGRKRSMVG 
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FIGURE IS 

cun T n NO- 195 17000139801197 H, IRAKM_H 

Sgncgargalsahtllfdlppallgelcavldscdgalg™ 

YVDQGKSGTRELLWSWAQKNKTIGDLLQVLQEMGHRRAIHLI^ 

FPNI LFKETANVTVDNVLI PEHNEKGVLLKSS I SFQNI I EGTRNFHKDFLI GEGE I FEVY 

RvflQN^A^FKQEKKMQCKKHWKRFLSELEVLLLFHHPNILEI^YFTETE 

lddqtop^tdfamahfrshlehqsctinmtsssskhlwympeeyirqgklsiktdvy^ 
g?v^e^gcrwlddpkhiqlrdllrelmekrgldsclsfldkkvppcpp^fsaklfc 
SgrcaSS^RPSMdevlntlestqaslyfaedpptslksfrcpsplflenvpsipve 
Sesqn^Spsdeglridrmtqktpfecsqsevmflsldkkpeskrneeacnmpsssce 



ESWFPK' 



yivpsqdlrpykvnidpsseapghscrsrpvesscsskfswdeyeqykke 



qEO ID NO: 196 AA840598 M IRAKM M mmr , T „„ 

mwkrflselevlllFrhphilelaayfteteklclvypymsngtlfdrlqctngttplsw 

XsvlSiISiqylhntqpcavicgnvssanillddqlqpkltdfa^ 

^Sgggrkhl^mpeeyirqgrlsvktdvysfgivimevltgckvvldd 

dllmelmekrgldsclsfldrkippcprnfsaklfslagrcvatkaklrptmdevlssle 

s^lyfa^dpptslksfrcpsplfldnvpsipveddenqnnhsvpp^^ 
?p?ecsqsevtflgldrnrgnrgseadcnvpsssheec W spelvapsqdlsptvislgss 

wevpghsygskpmekrcssglfcseheqskkq 

qpo ID NO: 197 AA088547 H 

^sawgsrpwprlglqlqfaalIlgtlspqvhtlrpenlllvstldgslhalskqtgdl 
^Sdpviegpmyvtemaflsdpadgslyilgtqkqqglmklpftipelvhaspcrss 

dgwy?grkqdLfvvdpesgetqmtlttegpstprlyigrtq 

^SrYsSXsPGKYMSHIASCGMGLLLTVDPGSGTVLWTQDLGVPVMGWTW 

rolphSlardtlhfialrwghirlpasgprdtatlfstldtqllmtlyvgkdetgfyvs 
Stg^lvprgltlapadgpttdevtlqvsgeregspstavrypsgsvalpsqwlli 

^S^^VHPTLGSGTAETRPPENTOAPAFFLELLSLSREKLWDSELHPEEK 

^pdIySIgpqdli^sltavllggwilfvmrqwekqqetpl^ 
gasrrsokrlqspskqaqplddpeaeqltwgkisfnpkdvlgrgaggtfvfrgqfegra 

?a™l2r^c?glw^vqllqesdrhpnvlryfctergpqfhyialelcraslqeyven 
pSglSe^qqlmsgi^lhslhivhrdl^^ 

kklpagrcsfslhsgipgtegwmapellqllppdsptsavdifsagcvfyyvlsggshpf 

SSrqSiltgapciahleeevhdkwardlvg^ 
akqlqf?q^vsd^ekeseqeplvraleagg 

swdSS^kkhhyrelpvevrqalgqvpdgfvqyftorfprlllhthramrscases 
lflpyyppdsearrpcpgatgr 



SEQ ID NO: 198_HGP_6644466 



►EGISNFKTPSKLsiKKKivLCSTPTINZP^^ 

^^IM^LS^mDEDK^FDESD^DEAVYAALGTRPPMMEELDESyQKVIELFSVC 

tnedpkdrpsaahivealetdv 
sprglshspwavkkT^lcddhyrtvy 

C^MEYGGEKSLNDLIEERNKDSGSPFPAAVILRVALHMARGLKYLHQEKKLLHGD 
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FIGURE IT 

NWI KGDFETI KI CDVGVSLPLDENMTVTDPEACY I GTEPWKPKEALEENGI I TDKADVF 
AFGLTLWEMMTLCI PHVNLPDDDVDEDATFDESDFDDEAYYAALGTRPS INMELDDS YQK 

AIELFCVCTNEDPKDRPSAAHIVEALELDGQCCGLSESKH 

SEQ ID NO: 200_5R57_10_2_M TESK2_M 
LLDSDLYLPWTVRVKLAYGIAVGLSYLHFKGIFHRDLTSKV 

SEQ ID NO: 201_AA2322 53_H 

MSSLGASFVQIKFDDLQFFENCGGGSFGSVYRAKWISQDKEVAVKKLLKIEKEAEILSVL 
SHRNIIQFYGVILEPPNYGIVTEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHY 
LHMEAPVKVIHRDLKSRNWIAADGVLKICDFGASRFHNHTTHMSLVGTFPW^4APEVIQS 
LPVSETCDTYSYGWLWEMLTREVPFKGLEGLQVAWLWEKNERLTIPSSCPRSFAELLH 
OCWEADAKKR PS FKQI I S I LESMSNDTSL PDKCNS FLHNKAEWRCE I EATLERLKKLERD 
LSFKEQELKERERRLKMWEQKLTEQSNTPLLPSFEI GAWTEDDVYCWVQQLVRKGDS SAE 
MSVYASLFKENNITGKRLLLLEEEDLKDMGIVSKGHIIHFKSAIEKLTHDYINLFHFPPL 
TKDSGGEPEENEEKIVNLELVFGFHLKPGTGPQDCKWKMYMEMDGDEIAITYIKDVTFNT 
NLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRTPKSTKHVHLIQWSRTKPQ 
DEVKAVQLAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQIASNTSLQRSQSNPILGSP 
FFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYGLTKNFSSLHLNSRDSGFS 
SGNTDTSSERGRYSDRSRNKYGRGSISLNSSPRGRYSGKSQHSTPSRGRYPGKFYRVSQS 
ALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSKVSEGGWTKVEYRKKPHRP 

S PAKTNKERARGDHRGWRNF 
SEQ ID NO: 202_AI375137_H 



GDGSYVSVPSPLGKIKSMTKEKAUJ.ljJ-.J-.KA^ijfsnrnijVJ-'^cJ-^ 1 

RCRNKIVAIKRYRANTYCSKSDVDMFCREVSILCQLNHPCVIQFVGACLNDPSQFAIVTQ 

YISGGSLFSLLHEQKRILDLQSKLIIAVDVAKGMEYLHNLTQPI IHRDLNSHNILLYEDG 

HAWADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILT 

GEIPFAHLKPAAAAADMAYHHIRPPIGYSIPKPI SSLLIRGWNACPEGRPEFSEWMKLE 
ECLCNIELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYALNARSYAA 
LSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS 

SEO ID NO: 203 H97685_H 

MESERSPLYRQLIDLGYLSSSHWNCGAPGQDTKAQSMLVEQSEKLRHLSTFSHQVLQTRL 

VDAAKALNL VHCHCLD I F I NQAFDMQRDLQ I TPKRLE YTRKKENELYE S LMN I ANRKQEE 
MKDMIVETLNTMKEELLDDATNMEFKDVIVPENGEPVGTREIKCCIRQIQELI ISRLNQA 
VANKLISSVDYLRESFVGTLERCLQSLEKSQDVSVHITSNYLKQILNAAYHVEVTFHSGS 
SVTRMLWEQIKQIIQRITWVSPPAITLEWKRKVAQEAIESLSASKLAKSICSQFRTRLNS 
qHEAFAASLRQLEAGHSGRLEKTEDLWLRVRKDHAPRLARLSLESRSLQDVLLHRKPKLG 
OELGRGOYGVVYLCDNWGGH F PCALKS WP PDEKH WNDLALE FH YMRS L P KHERLVDLHG 
S VI DYNYGGGS S I AVLL I MERLHRDLYTGLKAGLTLETRLQ I ALDWEG I RFLHSQGLVH 
RDIKLKNVLLDKQNRAKITDLGFCKPEAMMSGSIVGTPIHI^IAPELFTGKYDNSVDVYAFG 



1o 
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FIGURE 1U 

rpIlgivqpmlqgimnrlcksnseqpnrglddst 

f^tapsftkntdlslegdpfsqpikfp^ 
pSpenqtvniqiwpREPCddvk^ 

KFAAAEITEGEAEGWEEEGELNVJEDNNW 

qFO ID NO: 206 AI052250_H t^omt^out acjrCNGLAWKIFNGTKKSTKQE 

WVFVFDKKLIDK^QKFEKDQIIDSLK^QQLTRLROTR LSFLHSSVKM VHGNIT 

YPSMKNALIPRIKNACYKHLPLRFV 

QEO ID NO: 207 AA278842_H „ D ™vaTrSPVSIFVYDVKPGAEEQTQV 
MWFFARDPWDFPFELIPEPPEGGLPG^ 

EANLNVELMKHFARLQAKDEQGPIR^ 

APSRVAGVLGFAATHNLYS^DCAQKILPVLCGLTVU ^ IRSH PTTAPTETNIPQ 

SLemeakraerkvakgpmklgarkld 

1^/ ll} 
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FIGURE IV 



SEQ ID NO: 2 0 8_AA599286 H nSHTEYI I RVQGGI SVENSWQI VRRYSDFD 

MAFMEKPPAGKVLLDDTVPLTAAIEASQSLQS 

^SLQIAGLSLPLPPKKLIG^^ 
PNNYSANYTEIAWQVSMFFRSEP^EWEPL^ 

ADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRV^ 

AKPKDPFLKKYCNPKKIQGLELQQIKTYGRQI^ 

CRLLDLENSLLGLPSFYRSYFSQFRKINTLES^VHCFG pQFK1 p T KLKEALR 

iaSciekrlieeqkqihqhrrltraqshhgseee^ 

KYSNSNNSAGSGASSPLTSPS™ 

MSASTGGGGDSGGSGG^SSS^A 
=== 

SEQ ID NO: 210_SGK022 H SEFIQRFLPRE LQ 
MEDFLLSNGYQLGKTIGEGTYS^KE^ 

IVRTLDHKNIIQVYEMLESADGKICLV^LAEGGD ELSQTFC GSTAYAAPEV 
SIS 

lSrllepdmilrpsieevswhpwlast 

SEQ ID NO: 211_AA060026^ M SGK022 M GGPEEFIQRF LPRELQ 
MEDELLSNGYQLGKTIGEGTY^^^ 

IVRTLDHKNIIQVYEMLESADGOYLV^LAEGGD RELSQTF CGSTAYAAPEV 

lSSllepdmilrpsieevswhpwlast 
csepiagkwfsqltlgiaylhOT 

^R^ 
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FIGURE 1W 

SEQ ID NO: 214_AA883975 H RRRAppDFVNKFLPR E 

MSGDKLLSELGYKLGRTIG^ 

LSILRGVRHPHIVHVFEFIEV^^^ 

^LIAELLQFSPSARPSAGQVARNCWLRAGDSG 

SEQ ID NO: 2 1 5_AA90 54 4 6 JH pRELQIVRTLD HKNIIQVYEMLESADG 

Q sSdl[kIlle P d M lr P sieevs M hpv,«st 

SEO ID NO: 216_H29914 H ApENVELA1 J\ E FWALTSLKRRHQNV 
VSLIAEIGRGSYGVVYEAVMRSGARVAVKKIRCDAPEN EEpcYLWFVME FCEGG 
VQFEECVWRNGIAQRMSHGNKSSQLY^^KGERI^L LITERSGTPIIi KVAD 

^qSSSaheQDRPDAfeletrmdqvtcaa 

SEQ ID NO: 2^-^ 4981 °l-" " p "R S ivQRVLCEKLRPAAQA«DPAGAEVPGEA 
PlLpPPPW^^^^SSS^EMAGRSGARVAVKKIRCDAPEBVE 
FI^RRPDGGGGDVPARPRVSLIAEIGRGSTCWYEAVA LVETSLKGE RII, 

DLEPTLKVADFGLSKVCSASGQNPEEPVSVNKC^ 

^kodiSSakp Q drpdafe L elrlv Q iaekd S swet 

^fp^ggeS^ 
Spqnilesslekphdkladfgfaq™ 

s^s— — — " 

a3/ ||} 
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FIGURE IX 

THYEVDAORKEAIKAKVGQYVSRAEELKAIVSSSNQALLRQGTSARDLLREMARDKPRLL 
AALEVASAAMAKEEAAGGEQDALDLYOHSLGELLLLLRSPRAGGGSCFTLRFRTSWPELN 



qEO ID NO: 220 AA311714_H 

MENF I LYEE I GRGSKTWYKGRRKGT I NFVAI LCTDKCRR PE I TNWVRLTRE I KHKN I VT 
FHEWYETSNHLWLVXENLPEDVWEFGIDLISGLHHLHKLGILFCDISPRKILLEGPGTL 
KFSNFCLAKVEGENLEEFFALVAAEEGGGDNGENVLKKSMKSRVKGSPVYTAPEWRGAD 
FSISSDLWSLGCLLYEMFSGKPPFFSESVSELTEKILCEDPLPPIPKDSSRPKASSDFIN 
LLDGLLORDPQKRLTWTRLLQHSFWKKAFAGADQESSVEDLSLSRNTMECSGPQDSKELL 
ONSOSRQAKGHKSGQPLGHSFRLENPTEFRPKSTLEGQLNESMFLLSSRPTPRTSTAVEV 

VKFDAKILHLPTYSVDKLLFLKDQDWNDFLQQVCSQIDSTEKSMGASRAKLNLLCYLCW 

A^HQEVATRLLHSPLF^^ 
SIGIGILNCLVQHSTPVPRQCLVYV 

SEQ ID NO: 221_SGK384_H 

S LAHVLRARQ I LTEPE VRD YLRGLVS GLR YLHQRC I LHR 
SEO ID NO- 222 AA210451 M SGK384_M 

MGQQHGTRNGLTHRELPRGVGLLLAMALMNVALYLCLDQLFISPGRSTADSRRCP^ 
MGRMRNCSRWLSCEELRTEVRQLKRVGEGAVKRVFLSEWKEHKVALSRLTRLEMKEDFLH 

gSSslqsehwtlvgyceedgtilteyhplgslsnleetlnlskyqdvntwqhrlql 

A^EY^SIIN^LH^kGTRVMCDSNDLPKTLSQYLLTSNFSIVANDLDALPLVDHDSGVL 
^COTRE^DFVAPEQLWPYGEDTPFQDDLMPSYNEKVDIWKIPDVSSFLLGHVEGSDM 
VRFHLFDIHKACKSQIPAERPTAQNVLDAYQRVFHSLRDTVMSQTKEML 

qFO ID NO- 223 SGK071 2 H 

EWAVQ^ECMDDHYASQALiElMPLLKLRHAHISVYQELFITWNGEISSLYLCLVMEF 

NELS FQEVI EDKRKAKKI I DSEWMQNVLGQVLDALEYLHHLDI I HRNLKPSNI I LI S SDH 
CKXQDLSSNVLMTDKAKWNIRAEEDPFRKSW^4APEALNFSFSQKSDIWSLGCI ILDMTSC 
SF^ID^TE^^LRKSLRQSPGSLKAVLKTMEEKQIPDVETFRNLLPLMLQIDPSDRITIKD 

V^nTFLRGSFKSSCVSLTLHRQMVPAS I TDMLLEGNVASI LGDAGDTKGERALKLLSMA 
I^SYCLVPEGSLFMPLALLHMHDQWLSCDQDRVPGKRDFASLGKLGKLLGPIPKGLPWPP 

elSevvvttmelhdrvldvqlcacslllhllgqalvhhpeakapcnqaitstllsalqsh 
peSpTSsllaitttqeseslseelqnagllehilehlnsslesrdvcasglgllw 

IlllddpIl^qrprkkrapnhgkpgkpknpastqsiivnkapl^ 
dge^eascgvfwllsllgcikeqqfeqwalllqsirlcqdrallviwayrglaslvkv 

sel^fWqeeggsglsliketyqlhrddpewenvgmllvhlasyeeilpelvsssm 
kallqeikerftsslvsdssafskpglppggspqlgcttsggle 



qFO ID NO- 224 AA118352 M SGK071_M 

ISpcqkswmapeaIkfsfstksBiwslgciildmatcsflndteamqlrkairhhpgs^ 
kp?lk?me1kqipgtdw^ 

Spif^dvllegnmanilgswlcasfvndsrhcdsgigsqrlgfdfqsv^ 
WnfssLevqlrainklltmpedqlglpwptelleevisiikqhgrildil^ 
^vlgoalakdpeaei prssli I sflmdtlrshpnserlvnwynvlai issqgqi seel 
eeegSq^qSlIfqedrdiclsilsl 

ed^iIe^cavlwllsllgcikesqfeqvwlllrsiqlcpgrvllvnnafrglaslak 



-w I/) 
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FIGURE 1Y 

VSELVAFR I WLEEGS SGLHL IQDI YKLYKDDPEWENLCMLLAHLTS YKE I LPEMESGG 
I KDLVQVI RGRFTS SLEL I SYADE I LQVLEANAQPGLQEDQLEPPAGQEAPLQGEPLFRP 

SEQ ID NO: 225_018653 . 9_H 

GRGRGAGHARGLGRGPAGRRAEPPRSLSRPGPGPGSRAGPAGRGEGSDAAPAGGSGRGFL 
RLLPAGLRPQRALRSGSEPPRPGQSPEPSPAPGAGRRGGRGELARQIRARYEEVQRYSRG 
GPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPGPGSPGPGPRLGCA 
ALRNVSGAQYMGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGVRRGCYRLAA 
HKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVEMIQLLQTSWEDRF 
RICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVDGELKVTDLDDARVEETPCAGSTDCI 
LEFPARNFTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDS I VNATGE 

LAWG VDETLAQLEKVLHL YR S GQYLQN S TAS S S TE Y QC I PDSTI PQEDYRCWPS YHHGSC 
LLSVFNLAEAVDVCESHAQCRAFWTNQTTWTGRQLVFFKTGWSQWPDPNKTTYVKASG 

SEQ ID NO: 226_AA396601_M 

TRPGCAALRNVSGAQYVGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGARRG 
CYRLAAHKLLKEMVLLERLRHPNVLQLYGYCYQDSEGIPDTLTTITELGAPVEMIQLLQT 
SWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLVNGELKVTDLDDARVEETPCT 
SSADCTLEFPARNFSLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSI 
VNATGELAWGVDETLAQLETALHLFRSGQYLQNSTSSRAEYQRIPDSAITQEDYRCWPSY 
HHGGCLLSVFNLAEAIDVCESHAQCRAFWTNQTTWTGRKLVFFKTGWNQWPDAGKTTY 

VKAPG 

SEQ ID NO: 227_VRK3_H 

MISFCPDCGKSIQAAFKFCPYCGNSLPVEEHVGSQTFVNPHVSSFQGSKRGLNSSFETSP 
KKVKWSSTVTSPRLSLFSDGDSSESEDTLSSSERSKGSGSRPPTPKSSPQKTRKSPQVTR 
GSPQKTSCSPQKTRQSPQTLKRSRVTTSLEALPTGTVLTDKSGRQWKLKSFQTRDNQGIL 
YEAAPTSTLTCDSGPQKQKFSLKLDAKDGRLFNEQNFFQRAAKPLQVNKWKKLYSTPLLA 
IPTCMGFGVHQDKYRFLVLPSLGRSLQSALDVSPKHVLSERSVLQVACRLLDALEFLHEN 
EYVHGNVTAENIFVDPEDQSQVTLAGYGFAFRYCPSGKHVAYVEGSRSPHEGDLEFISMD 
LHKGCGPSRRSDLQSLGYCMLKWLYGFLPWTNCLPNTEDIMKQKQKFVDKPGPFVGPCGH 
W I rps ETLQKYLKWMALTYEEKPPYAMLRNNLEALLQDLRVS PYDP I GLPMV P 

SEQ ID NO: 22 8_S71575_M VRK3_M 

IPTCIGFGIHQDKYRFLVFPSLGRSLQSALDDNPKHWSERCVLQVACRLLDALEYLHEN 
EYVHGNLTAENVFVNPEDLSQVTLVGYGFTYRYCPGGKHVAYKEGSRSPHDGDLEFISMD 
LHKGCGPSRRSDLQTLGYCMLKWLYGSLPWTNCLPNTEKITRQKQKYLDSPERLVGLCGR 
WNKASETLREYLKWMALNYEEKPPYATLRNSLEALLQDMRVSPYDPLDLQMVP 

SEQ ID NO: 229 AA45427_H 

MGHALCVCSRGTVIIDNKRYLFIQKLGEGGFSYVDLVEGLHDGHFYALKRILCHEQQDRE 
EAQREADMHRLFNHPNILRLVAYCLRERGAKHEAWLLLPFFKRGTLWNEIERLKDKGNFL 
TEDOILWLLLGICRGLEAIHAKGYAHRDLKPTNILLGDEGQPVLMDLGSMNQACIHVEGS 
ROALTLQDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQ 
KGDSVALAVQNQLS I PQSPRHSSALRQLLNSMMTVDPHQRPHI PLLLSQLEALQPPAPGQ 
HTTQI 

SEO ID NO: 230 H05721 H 

MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVG 
^LPNRLRFFRQSVAGLAARLQRQFWRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRR 
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AVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQ 
NLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPLAIKMMWNISAGSSSEAILNTMSQE 
LVPASRVALAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLP 
SRLHPEGLGHGRTLFLVMKNYPCTLRQYLCVNTPSPRLAAMMLLQLLEGVDHLVQQGIAH 
RDLKSDNILVELDPDGCPWLVIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVST 
ARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPP 
DVRQLVRALLQREAS KRP S ARVAANVLHLS LWGEH I LALKNLKLDKMVGWLLQQ S AATLL 
ANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL 

SEQ ID NO: 231_AI086865_H 

MEKYERI RWGRGAFGI VHLCLRKADQKLVI I KQI PVEQMTKEERQAAQNECQVLKLLNH 
PNVI EYYENFLEDKALMI AMEYAPGGTLAEF I QKRCNSLLEEETI LHFFVQ I LLALHHVH 
THLILHRDLKTQNILLDKHRMWKIGDFGISKILSSKSTPCYISPELCEGKPYNQKSDIW 
ALGCVLYELASLKRAFEAANLPALVLKIMSGTFAPISDRYSPELRQLVLSLLSLEPAQRP 
PLSHIMAQPLCIRALLNLHTDGREVRGPQQHREQDHQCPLQRGIIMTFGSGSNGCLGHGS 
LTDISQPTIVEALLGYEMVQQVEEALSFTLLGSAPLDQEPLLSIDLGTAHSAAVTGEEDL 
GSGDVNRLPSWERGHLLAGVASSTDVSTFSEGDCKEPDKCCWRHKQCTGHI IYPFASDCV 
RHSLHLHSVNHCNCNSRLKDSSEDSSSSRGAGPTCSHVIESPCFELTPEEEHVERFRYGW 
CKSYRPVSVAVIHHPLYHECGADDLNXKKRKRRRRKSKPPIPTQVGPATASPDLGTSMAT 
GTPDSTAPITIWRSESPTGKGQGSKVIKKVKKKKEKEKDKEEMDEKAKLKKKAKKGQLTK 
KKSPVKLEPSPPDVSRSLSARQLARMSESSPESREELESEDSYNGRGQGELSSEDIVESS 
SPRKRENTVQAKKTGAKPSQARKVNKRKSPPGSNPNLS 

SEQ ID NO: 232_AA836348_H 

MSVLGEYERHCDSINSDFGSESGGCGDSSPGPSASQGPRAGGGAAEQEELHYIPIRVLGR 
GAFGEATLYRRTEDD S LWWKEVDLTRLSE KERRDALNE I V I LALLQHDN 1 1 AY YNH FMD 
NTTLLI ELE YCNGGNLYDKI LRQKDKLFEEEMWWYLFQ I VSAVS C I HKAG I LHRD I KTL 
NIFLTKANLIKLGDYGLAKKLNSEYSMAETLVGTPYYMSPELCQGVKYNFKSDIWAVGCV 
IFELLTLKRTFDATNPLNLCVKIVQGIRAMEVDSSQYSLELIQMVHSCLDQDPEQRPTAD 
ELLDRPLLRKRRRSSTVTEAPIAWTSRTSEVYVWGGGKSTPQKLDVIKSGCSARQVCAG 
NTHFAWTVEKELYTWVNMQGGTKLHGQLGHGDKASYRQPKHVEKLQGKAIRQVSCGDDF 
TVCVTDEGQLYAFGSDYYGCMGVDKVAGPEVLEPMQLNFFLSNPVEQVSCGDNHWVLTR 
NKEVYSWGCGEYGRLGLDSEEDYYTPQKVDVPKALIIVAVQCGCDGTFLLTQSGKVLACG 
LNEFNKLGLNQCMSGIINHEAYHEVPYTTSFTLAKQLSFYKIRTIAPGKTHTAAIDERGR 
LLTFGCNKCGQLGVGNYKKRLGINLLGGPLGGKQVIRVSCGDEFTIAATDEKVLNSKTIR 
SNSSGLSIGTVFQSSSPGGGGGGGGGEEEDSQQESETPDPSGGFRGTMEADRGMEGLISP 
TEAMGNSNGASSSCPGWLRKELENAEFIPMPDSPSPLSAAFSESEKDTLPYEELQGLKVA 
SEAPLEHKPQVEASVTELFAFESQLVTSAESCSNLCWEGNTTDSSCVCVQLSAGGG 

SEQ ID NO: 233_R86668_H, MKK6_H 

MNLLLS YRDVQDYSAI I ELVETLQALPTCDVAEQHNVCFHYTFALNRRNRPGDRAKALSV 
LLPLVQLEGSVAPDLYCMCGRIYKDMFFSSGFQDAGHREQAYHWYRKAFDVEPSLHSGIN 
AAVLLIAAGQHFEDSKELRLIGMKLGCLLARKGCVEKMQYYWDVGFYLGAQILANDPTQV 
VLAAEQLYKLNAPIWYLVSVMETFLLYQHFRPTPEPPGGPPRRAHFWLHFLLQSCQPFKT 
ACAQGDQCLVLVLEMNKVLLPAKLEVRGTDPVSTVTLSLLEPETQDI PSSWTFPVAS I CG 
VSASKRDERCCFLYALPPAQDVQLCFPSVGHCQWFCGLIQAV7VTNPDSTAPAEEAEGAGE 
MLE FDYE YTETGERLVLGKGT YG WYAGRDRHTRVR I A I KE I PERDSRFSQPLHEEI ALH 
RRLRHKNIVRYLGSASQGGYLKIFMEEVPGGSLSSLLRSVWGPLKDNESTISFYTRQILQ 
GLGYLHDNHIVHRDIKGDNVLINTFSGLLKISDFGTSKRLAGITPCTETFTGTLQYMAPE 
IIDQGPRGYGKAADIWSLGCTVIEMATGRPPFHELGSPQAAMFQVGMYKVHPPMPSSLSA 
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FIGURE 1AA 

EAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKRSRSPSSPRHAPRPSDAPSASPTPSAN 
STTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQLRVPEEPAAEEPASPEESSGLSLLHQE 
SKRRAMLAAVLEQELPALAENLHQEQKQEQGARLGRNHVEELLRCLGAHIHTPNRRQLAQ 
ELRALQGRLRAQGLGPALLHRPLFAFPDAVKQI LRKRQI RPHWMFVLDSLLSRAVRAALG 
VLGPEVEKEAVSPRSEELSNEGDSQQSPGQQSPLPVEPEQGPAPLMVQLSLLRAETDRLR 
EI LAGKEREYQALVQRALQRLNEEARTYVLAPEPPTALSTDQGLVQWLQELNVDSGT I QM 
LLNHSFTLHTLLTYATRDDLIYTRIRGGMVCRIWRAILAQRAGSTPVTSGP 

SEQ ID NO: 234_PAK6_H 

MFGKKKKKI E I S G PSNFEHRVHTGFDPQEQKFTGLPQQWHS LLADTANRPKPMVDP S C I T 
P I QLAPMKT I VRGNKPCKETS I NGLLEDFDN I SVTRSNSLRKES PPTPDQGAS SHG PGHA 
EENGFITFSQYSSESDTTADYTTEKYREKSLYGDDLDPYYRGSHAAKQNGHVMKMKHGEA 
YYSEVKPLKSDFARFSADYHSHLDSLSKPSEYSDLKWEYQRASSSSPLDYSFQFTPSRTA 
GTSGCSKESLAYSESEWGPSLDDYDRRPKSSYLNQTSPQPTMRQRSRSGSGLQEPMMPFG 
ASAFKTHPQGHSYNSYTYPRLSEPTMCIPKVDYDRAQMVLSPPLSGSDTYPRGPAKLPQS 
QSKSGYSSSSHQYPSGYHKATLYHHPSLQSSSQYISTASYLSSLSLSSSTYPPPSWGSSS 
DQQPSRVSHEQFRAALQLWSPGDPREYLANFIKIGEGSTGIVCIATEKHTGKQVAVKKM 
DLRKQQRRELLFNEWIMRDYHHDNWDMYSSYLVGDELWWMEFLEGGALTDIVTHTRM 

NEEQ I ATVCLS VLRALS YLHNQGVI HRD IKSDSI LLTSDGR I KLSDFGFC AQVS KEVPKR 
KSLVGTPYWMAPEVISRLPYGTEVDIWSLGIMVIEMIDGEPPYFNEPPLQAMRRIRDSLP 
PRVKDLHKVSSVLRGFLDLMLVREPSQRATAQELLGHPFLKLAGPPSCIVPLMRQYRHH 

SEQ ID NO: 235_SURTK106_H 

MNDRNEIQMEAKLQSLTI I AQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI 
LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVI IVPTLLVTIFLILLGVILWL 
FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ 
VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF 
HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLGFLWTCRRDVMTMDGLLYDLTEKQ 
VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT 
IPLKWLAPERLLLRPASIRADVWSFGILLYEMVTLGAPPYPEVPPTSILEHLQRRKIMKR 
PSSCTHTMYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVLQVPELWPELYAAV 

AGIRVESLFYNYSML 

SEQ ID NO: 236_AA098024_M 

LQEKHLFHGDVAARNILIQSDLTPKLCHLGLAYEVHAHGAISSARSSTIPLKWLAPERLL 
LRPASIRGDIWSFGILLYEMVTLGAPPYPEVPPTSILQYLQRKKIMKRPSSCSHAMYNIM 
KCCWRWSEDSRPLLVQLLQRLEAASRSADDKAVLQVPELWPELYADVAGIRAES I S YSF 

SVL 

SEQ ID NO: 2 3 7_SGK2ALPHA_H 

MNSSPAGTPSPQPSRANGNINLGPSANPNAQPTDFDFLKVIGKGNYGKVLLAKRKSDGAF 

YAVKVLQKKS I LKKKEQS H I MAER S VLLKNVRH P FLVGLR Y S FQTPE KL Y FVLD YVNGGE 
LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNII YRDLKPENILLDCQGHWLTDFGL 
CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS 
QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL 
YHKRLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPE 

DDDILDC 
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MDQYCI LGR I ^^^^^^^^^^^gpL^VV^^Q^^^^Q^^^^^^^^^^^^^ 
AASQGLHMQNDDACLGAASPECCLLVKEKCRE 

SE Q ID NO: 239_TESK2 H VGRVWPSS YRALISAFSRLTRLDDFT 
MDRSKRNSIAGFPPRVERLEEFEGGGG^^SQVG^ VQLMNRLSH PNILRYINSG 
CEKIGSGFFSEVFKVRHRASGQVT^LKI^ 

nleqlldsnlhlpwtwvk^ydiavglsylh™ 

DFGI^EKIPDVSMGSEKIAWGSPFW^PEVLRDEPYN^ 
PDYLPRTENFGLDYDAFQHMVGDCPPDFLQLTFNCCNM^ 

QEEEQERDRKLQPTARGLLEKAPG^ 

prtvsvldpyyrprdgaar™^ 

SIS™ 

stsgiglqtqgkqdg 
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FIGURE 2A 

SEQ ID NO: 1 X69117_H BARK2_H 

ATGGCGGACCTGGAGGCCGTGCTGGCCGATGTCAGTTACCTGATGGCCATGGAGAAGAGC 

AAGGCGACCCCGGCCGCCCGCGCCAGCAAGAGGATCGTCCTGCCGGAGCCCAGTATCCGG 

AGTGTGATGCAGAAGTACCTTGCAGAGAGAAATGAAATAACCTTTGACAAGATTTTCAAT 

CAGAAAATTGGTTTCTTGCTATTTAAAGATTTTTGTTTGAATGAAATTAATGAAGCTGTA 

CCTCAGGTGAAGTTTTATGAAGAGATAAAGGAATATGAAAAACTTGATAATGAGGAAGAC 

CGCCTTTGCAGAAGTCGACAAATTTATGATGCCTACATCATGAAGGAACTTCTTTCCTGT 

TCACATCCTTTCTCAAAGCAAGCTGTAGAACACGTACAAAGTCATTTATCCAAGAAACAA 

GTGACATCAACTCTTTTTCAGCCATACATAGAAGAAATTTGTGAAAGCCTTCGAGGTGAC 

ATTTTTCAAAAATTTATGGAAAGTGACAAGTTCACTAGATTTTGTCAGTGGAAAAACGTT 

GAATTAAATATCCATTTGACCATGAATGAGTTCAGTGTGCATAGGATTATTGGACGAGGA 

GGATTCGGGGAAGTTTATGGTTGCAGGAAAGCAGACACTGGAAAAATGTATGCAATGAAA 

TGCTTAGATAAGAAGAGGATCAAAATGAAACAAGGAGAAACATTAGCTTTAAATGAAAGA 

ATCATGTTGTCTCTTGTCAGCACAGGAGACTGTCCTTTCATTGTATGTATGACCTATGCC 

TTCCATACCCCAGATAAACTCTGCTTCATCCTGGATCTGATGAACGGGGGCGATTTGCAC 

TACCACCTTTCACAACACGGTGTGTTCTCTGAGAAGGAGATGCGGTTTTATGCCACTGAA 

ATCATTCTGGGTCTGGAACACGTGCACAATCGGTTTGTTGTCTACAGAGATTTGAAGCCA 

GCAAATATTCTCTTGGATGAACATGGACACGCAAGAATATCAGATCTTGGTCTTGCCTGC 

GATTTTTCCAAAAAGAAGCCTCATGCGAGTGTTGGCACCCATGGGTACATGGCTCCCGAG 

GTGCTGCAGAAGGGGACGGCCTATGACAGCAGTGCCGACTGGTTCTCCCTGGGCTGCATG 

CTTTTCAAACTTCTGAGAGGTCACAGCCCTTTCAGACAACATAAAACCAAAGACAAGCAT 

GAAATTGACCGAATGACACTCACCGTGAATGTGGAACTTCCAGACACCTTCTCTCCTGAA 

CTGAAGTCCCTTTTGGAGGGCTTGCTTCAGCGAGACGTTAGCAAGCGGCTGGGCTGTCAC 

GGAGGCGGCTCACAGGAAGTAAAAGAGCACAGCTTTTTCAAAGGTGTTGACTGGCAGCAT 

GTCTACTTACAAAAGTACCCACCACCCTTGATTCCTCCCCGGGGAGAAGTCAATGCTGCT 

GATGCCTTTGATATTGGCTCATTTGATGAAGAGGATACCAAAGGGATTAAGCTACTTGAT 

TGCGACCAAGAACTCTACAAGAACTTCCCTTTGGTCATCTCTGAACGCTGGCAGCAAGAA 

GTAACGGAAACAGTTTATGAAGCAGTAAATGCAGACACAGATAAAATCGAGGCCAGGAAG 

AGAGCTAAAAATAAGCAACTTGGCCACGAAGAAGATTACGCTCTGGGGAAGGACTGTATT 

ATGCACGGGTACATGCTGAAACTGGGAAACCCATTTCTGACTCAGTGGCAGCGTCGCTAT 

TTTTACCTCTTTCCAAATAGACTTGAATGGAGAGGAGAGGGAGAGTCCCGGCAAAATTTA 

CTGACAATGGAACAGATTCTCTCTGTGGAAGAAACTCAAATTAAAGACAAAAAATGCATT 

TTGTTCAGAATAAAAGGAGGGAAACAATTTGTCTTGCAATGTGAGAGTGATCCAGAGTTT 

GTGCAGTGGAAGAAAGAGTTGAACGAAACCTTCAAGGAGGCCCAGCGGCTATTGCGTCGT 

GCCCCGAAGTTCCTCAACAAACCTCGGTCAGGTACTGTGGAGCTCCCAAAGCCATCCCTC 

TGTCACAGGAACAGCAACGGCCTCTGA 
SEQ ID NO: 2 AA144574_M BARK2_M 

CTGCTTCGTAGTCTACAGAGACCTGAAGCCTGCGAACATCCTCCTAGATGAATATGGGCA 
CGTGAGGATATCGGATCTCGGCCTTGCCTGTGATTTCTCCAAAAAGAAGCCTCATGCCAG 
CGTGGGCACCCATGGGTACATGGCTCCCGAGGTGTTGCAGAAGGGAACGTGCTATGACAG 
CAGCGCCGACTGGTTCTCCCTGGGCTGTATGCTCTTCAAACTTCTGCGGGGCCACAGCCC 
CTTCAGGCAGCATAAAACCAAAGACAAGCATGAGATAGACCGAATGACCCTGACCGTGAA 
CGTGCAGCTTCCAGATGCCTTCTCCCCTGAGCTGAGGTCCCTCTTAGAGGGTTTGCTCCA 
GCGGGACGTGAGCCAGCGGCTGGGCTGCGGAGGAGGAGGGGCACGAGAGTTGAAGGAGCA 
CATCTTCTTCAAGGGCATTGACTGGCAGCATGTGTACTTACGGAAGTACCCGCCACCCCT 
AATCCCTCCTCGGGGAGAGGTCAACGCTGCAGATGCCTTCGATATCGGCTCCTTCGATGA 
GGAAGACACCAAAGGCATTAAGCTGTTGGACTGTGACCAGGACCTCTATAAGAACTTCCC 
ACTGGTGATCTCCGAGCGCTGGCAGCAAGAAGTGGTGGAGACCATCTATGACGCCGTCAA 
TGCTGATACTGATAAAATCGAGGCCAGGAAGAAGGCTAAAAATAAGCAACTTGGTCAAGA 
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GGAAGATTACGCTATGGGGAAGGACTGCATCATGCACGGGTACATGCTGAAGCTGGGGAA 
CCCCTTTCTCACACAGTGGCAAAGACGCTATTTTTACCTGTTCCCCAACAGACTGGAGTG 
GAGAGGAGAGGGCGAGTCTCGGCAAAGTCTACTGACCATGGAACAGATCATGTCTGTGGA 
GGAGACCCAGATTAAAGACAGAAAGTGCATCTTACTCAGGATAAAGGGAGGGAAGCAATT 
TGTCTTGCAATGTGAGAGTGACCCCGAGTTTGCACAGTGGCTGAAGGAGCTGACCTGCAC 
CTTCAATGAGGCCCAGAGACTGCTGCGCCGTGCCCCCAAATTCCTCAACAAACCACGGGC 
CGCCATCCTGGAGTTCTCCAAGCCACCACTGTGTCACAGAAATAGCAGCGGCCTCTGAAC 
CACAGAGCAGCGGGGCCTGAAGGAGGGGCCCCAGCTCTTCAGCCCAGGAGTGGAACGAAG 
CCACGGGGAACCGTGTGGGGCTAAGACACAGTGTTTCTGAGCACTGACGGGGCTGCTCCA 
AGCCGAGGAGGCTCAGGACACCAGGGCGGCCTTCTGGGAGCTGGGACATCCTCGGGGCTG 
TCCTATCCACACTCGAAATTACTGAAGAAGCAGAGGCATTCTGCTGTG 

SEQ ID NO: 3 AA826850_H 

GAAGAGGATGGGCTCGTCCATGTCGGCGGCCACCGCGCGGAGGCCGGTGTTTGACGACAA 

GGAGGACGTGAACTTCGACCACTTCCAGATCCTTCGGGCCATTGGGAAGGGCAGCTTTGG 

CAAGGTGTGCATTGTGCAGAAGCGGGACACGGAGAAGATGTACGCCATGAAGTACATGAA 

CAAGCAGCAGTGCATCGAGCGCGACGAGGTCCGCAACGTCTTCCGGGAGCTGGAGATCCT 

GCAGGAGATCGAGCACGTCTTCCTGGTGAACCTCTGGTACTCCTTCCAGGACGAGGAGGA 

CATGTTCATGGTCGTGGACCTGCTACTGGGCGGGGACCTGCGCTACCACCTGCAGCAGAA 

CGTGCAGTTCTCCGAGGACACGGTGAGGCTGTACATCTGCGAGATGGCACTGGCTCTGGA 

CTACCTGCGCGGCCAGCACATCATCCACAGAGATGTCAAGCCTGACAACATTCTCCTGGA 

TGAGAGAGGACATGCACACCTGACCGACTTCAACATTGCCACCATCATCAAGGACGGGGA 

GCGGGCGACGGCATTAGCAGGCACCAAGCCGTACATGGCTCCGGAGATCTTCCAXTCTTT 

TGTCAACGGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGC 

CTATGAGCTGCTGCGAGGATGGAGGCCCTATGACATCCACTCCAGCAACGCCGTGGAGTC 

CCTGGTGCAGCTGTTCAGCACCGTGAGCGTCCAGTATGTCCCCACGTGGTCCAAGGAGAT 

GGTGGCCTTGCTGCGGAAGCTCCTCACTGTGAACCCCGAGCACCGGCTCTCCAGCCTCCA 

GGACGTGCAGGCAGCCCCGGCGCTGGCCGGCGTGCTGTGGGACCACCTGAGCGAGAAGAG 

GGTGGAGCCGGGCTTCGTGCCCAACAAAGGCCGTCTGCACTGCGACCCCACCTTTGAGCT 

GGAGGAGATGATCCTGGAGTCCAGGCCCCTGCACAAGAAGAAGAAGCGCCTGGCCAAGAA 

CAAGTCCCGGGACAACAGCAGGGACAGCTCCCAGTCCGAGAATGACTATCTTCAAGACTG 

CCTCGATGCCATCCAGCAAGACTTCGTGATTTTTAACAGAGAAAAGCTGAAGAGGAGCCA 

GGACCTCCCGAGGGAGCCTCTCCCCGCCCCTGAGTCCAGGGATGCTGCGGAGCCTGTGGA 

GGACGAGGCGGAACGCTCCGCCCTGCCCATGTGCGGCCCCATTTGCCCCTCGGCCGGGAG 

CGGCTAGGCCGGGATGCCCGTGGTCCTCACCCCTTGAGCTGCTTTGGAGACTCGGCTGCC 

AGAGGGAGGGCCATGGGCCGAGGCCTGGCATTCACGTTCCCACCCAGCCTGGCTGGCGGT 

GCCCACAGTGCCCCGGACACATTTCACACCTCAGGCTCGTGGTGGTGCAGGGGACAAGAG 

GCTGTGGGTGCAGGGGACACCTGTGGAGGGCATTTCCCGTGGGCCCCCGAGACCCGCCTA 

GATGGAGGAAGCGCTGCTGGGCGCCCTCTTACCGCTCACGGGGAGCTGGGGCCATGGATG 

GGACAGGAGTCTTTGTCCCTGCTCAGCCCGGAGGCTGTGCACGGCCCTCGTCACAAGGTG 

ACCCTTGCAGCACAGGCCGCGGGTGCCCCAGGCTCGGCTCAGTTCTTGGAGGTCAAGGGC 

ATGGGTTGGGGTAGTGGGTGGGGAGGTGAATGTTTTCTAGAGATTCAAACTGCTCCAGCA 

ATTTCTGTATAGTTTTCACCTCTGAGAATTACAATGTGAGAACCGCTC 

SEQ ID NO: 4 AA960957_H 

GTCCCACATCCCGCATCCGGCATCCCAGCGGCCGGGCATGTAGCAGCGGCAGCAACGGCG 
GAATATGGGCGGGAACCACTCCCACAAGCCCCCCGTGTTTGACGAGAATGAGGAAGTCAA 
CTTTGACCATTTTCAGATTCTGCGGGCCATTGGTAAAGGGAGTTTTGGAAAGGTATGCAT 
CGTGCAGAAGCGAGACACTAAGAAAATGTATGCAATGAAGTACATGAACAAGCAGAAGTG 
CATCGAGAGGGATGAGGTTCGGAATGTTTTCCGGGAGCTGCAGATCATGCAAGGGCTGGA 
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GGTGGACCTGCTCCTGGGAGGCGACCTGCGCTACCATCTGC^ 

AGAGGGGACTGTGAAACTCTACATCTGT^ 

GTACCACATCATCCACAGAGACATCAAGCCAGACAA^^ 

TGTTCACATTACAGACTTCAACATAGCGACGGTAGTG^ 

CATGGCTGGCACCAAGCCCTACATGGC^ 
CCCCGGATACTCGTACCCTGTCGACTGGTGGTCCCTGGGC^ 

GCGGGGCTGGAGGCCGTACGAAATCCA^ 
CGTGCCCTACTTGGCCGACATGAACTGGGACG^ 

GCTCTTGGACACCGACAGCCGAGGGGGAGGCCAG^ 
CAACAACAACCTCCTCACCCACACCTGCACCCGTC^ 
TTGCTGCTCAACAGGACTGCACTCGTCTCTGCCCTGCC^ 
CCTGATGGTCCCTGTCTCACCCCTGAAAACATCAGATGC^A 

GCTGGGAAGCCTGGGTTCTGGTCCCATCTCCA^ 

GTCACGCCCTCTCTGTGCCTCCGTTTTCTGCATCl^ C ATGAAAAACCTT 
AAGGTCTCAGCGCTGCGGTCT^C^CTCCCCTCATTTAA^^ ^^^^^^^^^ 

TGGCATTTCTTGTTTGGAATRAACTATTTCTTGGACATTCCTTC 
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SEQ ID NO: 5JTBK1 H ' CGCCGGCG GTGGCGCGGCGGAGACCCGGCTG 

CATGTGGGAGTTTATACACTGTTTTAGAA^^ 

CTGAATTCTTAATTGTTTTGCG^ 
GTATAGTGCACCGTGATATCAAGCCAGGAAATATCA^ 

AGTCTGTGTACAAACTCACAGATTTTGGTG^ 

TTGTTTCTCTGTATGGCACAGAAGAATATTTGCACCL AGC ATTGGGGTAA 

TAAGAAAAGATCATCAGAAGAAATATGGAG^ 

CATTTTACCATGCAGCTACTGGATCACTGCCATTTAGACCC^ 

ATAAAGAAGTGATGTATAAAATAATTACAGGAAA^ 
TCCCTAAAACTACTGAGGAAAACCCTAT^ 

TAGGATTAATATATGAAAAAATTTCCCTCCCIAAM GCCTGCAGAAT TGCCA 

GGGATGCTAGCATGGCTAAGGCAATAACAGG^^ 

GTACCTTACTGCTTTATCAGGAATTAATGCG^ 

TTAAAGATGATTACAATGAAACTGTTCACAAAAA^ GTT GATGAAGATCAACC 
TCTGTATCAGAAACATTGAAAAAACTGTGAA^GTATATGAA^ 

TGG^GCGGCAGAGTTAGGTGAAATTTCAGACAT^ 



AAGCAGAACGTAGATTAGCTTATAATGAAGAACAAA^ ^ TGTGTTAAA AAGTATG 
— TATTACCATGCCACAAAAGCTATGACGCACTTTACAGAi GGAAACAGT 
GCATTTTTGAATAAGTCAGAAG^AATG^ 

TTATCGCTGACTARTCAGTGTTTTGATR 1 i ^ G ^ Tm 

Saatgagttacaagaaac^c^g^^tta^g^^ ctct ^ tr 



TGAAGAAATTAAAGGAAGAGATGGAAG^^^J ^ MCGTTGACTG TCTTT 
^TTTAGAAAGGTTTGGCTCTTTAA^TCGATGGTG^ 

AGCTTTCTAATAGAAGTTTAAGAAAAOTTCCGTll GTMTT GATGTGGTCG 
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GTTTTAAGCTGTATATTTCTTTAATTCTGCTTACTATTTCATGGAAAAAAATAAATTTCT 
CAATTTTAAAAAA 

SEQ ID NO: 6_AA3 05176_H 

TGGCTGCTCGCGGAGGGGCAGTGTACGCGGGGCCGCTGTAGGCTGTCCAGCGATGGATCC 
CACCGCGGGAAGCAAGAAGGAGCCTGGAGGAGGCGCGGCGACTGAGGAGGGCGTGAATAG 
GATCGCAGTGCCAAAACCGCCCTCCATTGAGGAATTCAGCATAGTGAAGCCCATTAGCCG 
GGGCGCCTTCGGGAAAGTGTATCTGGGGCAGAAAGGCGGCAAATTGTATGCAGTAAAGGT 
TGTTAAAAAAGCAGACATGATCAACAAAAATATGACTCATCAGGTCCAAGCTGAGAGAGA 
TGCACTGGCACTAAGCAAAAGCCCATTCATTGTCCATTTGTATTATTCACTGCAGTCTGC 
AAACAATGTCTACTTGGTAATGGAATATCTTATTGGGGGAGATGTCAAGTCTCTCCTACA 
TATATATGGTTATTTTGATGAAGAGATGGCTGTGA7^ATATATTTCTGAAGTAGCACTGGC 
TCTAGACTACCTTCACAGACATGGAATCATCCACAGGGACTTGT^AACCGGACAATATGCT 
TATTTCTAATGAGGGTCATATTAAACTGACGGATTTTGGCCTTTCAAAAGTTACTTTGAA 
TAGAGATATTAATATGATGGATATCCTTACAACACCATCAATGGCAAAACCTAGACAAGA 
TTATTCAAGAACCCCAGGACAAGTGTTATCGCTTATCAGCTCGTTGGGATTTAACACACC 
AATTGCAGAAAAAAATCAAGACCCTGCAAACATCCTTTCAGCCTGTCTGTCTGAAACATC 
ACAGCTTTCTCAAGGACTCGTATGCCCTATGTCTGTAGATCAAAAGGACACTACGCCTTA 
TTCTAGCAAATTACTAAAATCATGTCTTGAAACAGTTGCCTCCAACCCAGGAATGCCTGT 
GAAGTGTCTAACTTCTAATTTACTCCAGTCTAGGAAAAGGCTGGCCACATCCAGTGCCAG 
TAGTCAATCCCACACCTTCATATCCAGTGTGGAATCAGAATGCCACAGCAGTCCCAAATG 
GGAAAAAGATTGCCAGGTTTGAGGGACATTTATCTTAATGAAAATCAATTATGTATGTCA 
AATGAATGTGAGAAATATTATACCTTTTCATATAAATTCCATAAAGAT^ATGAAATTGTTA 
CATGAATGGCAGTCATAGTATTAATCAGAAATTCATTTTCCTGCACATTCTGTCAAATTC 
TTTTGAAATATTTCATTTCTCATTCAATTGTGACATTGTTCTTACTTGATTATATAATGA 
GATTCTTGCAGTAAATTGATAATAAATGCTTGGCTTCTGTGTATCTAGGTGGACCTCACT 
TGTTTTTAGAAGTCCTTCCCATGATACAGACATTGGCTTGTTGGTTTTGTTTTATTTTGT 
TTTTAACATATGTCATTTAAAAACTCATATTACCTCCTTTT 

SEQ ID NO: 7_AA116841_M 

CCACGCGTCCGATCCCATGGCCAGAAGGCGAAGAAAAGCTATCTGATAATGCTCAAAGTG 
CAATGGACATGCTTTTAACCATTGATGATTCAAAGAGAGCTGGAATGAGAGAACTAAAAC 
AGCATCCTCTCTTCAGTGAAGTGGACTGGGAAAATCTGCAGCATCAGACTATGCCTTTCG 
TACCCCAACCAGACGACGAAACAGATACATCCTATTTTGAAGCCAGAAATAATGCTCAAC 
ATCTGACCGTATCTGGGTTTAGTCTGTAGCACATGCGTGTCATTTTTATCTAACTTGTGA 
TATAGAATTAAGTTTTACAGTAATATGCTACTTAATACTAGATTGGTCTAAATGGGATAA 
AAGTCATTATTTTACCCAGACTGAACAGCTTTTAATTACTAAGTACAACAGTTTTTACAG 
AATTAAAATACTATAAGCAATATAATCAGTAATTAATCTTTACCTTAGAACTGTATATAA 
GCCATAATAGCTTTTTTCATCTTATTTATTCACTGCACTTTATGAAGAGCAAAGTATCAA 
TAAACTAAAACACTACCACTCTAAATAGAGGGAGTGAGCCGT 

SEQ ID NO: 8_AA2 56100_H 

AGGGAGCTGACGGGCGCCCGGCCGGCTGCGGTCCGTGCGGAGGCTGAGCCGGCCGCGGGC 
GCGACCGGAGGCAGTTTCCGTTACTATGGCAATGACGGCAGGGACTACAACAACCTTTCC 
TATGAGCAACCATACCCGGGAAAGAGTGACTGTAGCCAAGCTCACATTGGAGAATTTTTA 
TAGCAACCTAATTTTACAGCATGAAGAGAGAGAAACCAGGCAGAAGAAATTAGAAGTGGC 
CATGGAAGAAGAAGGATTAGCAGATGAAGAGAAAAAGTTACGTCGATCACAACACGCTCG 
CAAAGAAACAGAGTTCTTACGGCTCAAAAGGACCAGACTTGGCTTGGATGACTTTGAGTC 
TCTGAAAGTTATAGGAAGAGGAGCTTTTGGAGAGGTGCGGTTGGTCCAGAAGAAAGATAC 
AGGCCATATCTATGCAATGAAGATATTGAGAAAGTCTGATATGCTTGAAAAAGAGCAGGT 
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GGCCCATATCCGAGCAGAAAGAGATATTTTGGTAGAAGCAGATGGTGCCTGGGTGGTGAA 
GATGTTTTACAGTTTTCAGGATAAGAGGAATCTTTATCTAATCATGGAATTTCTCCCTGG 
AGGTGACATGATGACATTGCTAATGAAGAAAGACACCTTGACAGAAGAGGAAACACAGTT 
CTACATTTCAGAGACTGTTCTGGCAATAGATGCGATCCACCAGTTGGGTTTCATCCATCG 
GGATATTAAGCCAGACAACCTTTTATTGGATGCCAAGGGTCATGTAAAATTATCTGATTT 
TGGTTTATGTACGGGATTAAAGAAAGCTCACAGGACTGAATTTTATAGAAATCTCACACA 
CAACCCACCAAGTGACTTCTCATTTCAGAACATGAACTCAAAGAGGAAAGCAGAAACTTG 
GAAGAAGAACAGGAGACAACTGGCATATTCCACAGTTGGGACACCAGATTACATTGCTCC 
AGAAGTATTCATGCAGACTGGTTACAACAAATTGTGTGACTGGTGGTCTTTGGGAGTGAT 

TATGTATGAAATGCTAATAGGATATC CACCTTTCTGCT CTG AAACACCTCAAGAAAC ATA 

CAGAAAAGTGATGAACTGGAAAGAAACTCTGGTATTTCCTCCAGAGGTACCTATATCTGA 

GAAAGCCAAGGACTTAATTCTCAGATTTTGTATTGATTCTGAAAACAGAATTGGAAATAG 

TGGAGTAGAAGAAATAAAAGGTCATCCCTTTTTTGAAGGTGTCGACTGGGAGCACATAAG 

GGAAAGGCCAGCAGCAATCCCTATAGAAATCAAAAGCATTGATGATACTTCAAATTTTGA 

TGACTTCCCTGAATCTGATATTTTACAACCAGTGCCAAATACCACAGAACCGGACTACAA 

ATCCAAAGACTGGGTTTTTCTCAATTATACCTATAAAAGGTTTGAAGGGTTGACTCAACG 

TGGCTCTATCCCCACCTACATGAAAGCTGGGAAGTTATGAATGAAGATAACATTCACCCA 

TAACCAAGAGAACTCAGGTAGCTGCATCACCAGGCTTGCTTGGCGTAGATAACAATACAC 

TGAAATACTCCTGAAGATGGTGGTGCTTATTGACTACAAGAGGAAATTCTACAGGATTAG 

GATTTCTAAGACTACTATAGGAATTGGTTGGCAGTGCCAGCTGGCTCTTTTTTTTAATAT 

TTTATTATTTTTGTTAACTTTATTATATGAAGGTACTGGAATAAAAGGAACAGACATCCC 

TTTCTAACTGCACTGCCTACATGCGTATTAAGGTCCATTCTGCCTGTGTGTGCTGTGGCT 

TTGAACTGTAACACCTCTAATCAATTCAGGAGAAACACATATCATTTAAAGCAACATAGG 

CTAACCTGTAGGTAACACTGCAGTATTGATGTTTTACTGCAAATCTTATGGGTCTAGATA 

ATCAGTAAAAGCCATCTTCCATAGTTGGTGTTAGAACATTGCCCTATTGGTTTGGACATC 

TGTAGAATATATATGAAGACAATTTCTGTAATGGTTTTAAGAGATTTAAAAAGAAATTCA 

CTGGTTCTTTACAAAATAGAATTTATCATCAAGTTATTACACAAACTTCACAGTAAGGAG 

TGACAAGTTTATAATAAGGAAGACAAAGTTTAACACCTTCACTCAAGCACTCCACTAATA 

TATTTACGTTGCATTCAGAAATACTGATGACCTTCATATACGTAGTCTGTATACTCATAG 

GGAGATGTACTGTATTATATAACATGTAAAGTTGATTTTCTTGTGACAAGAGAACTTCTT 

TTTTTAACAAGAGGACATGGCATTATTTTAATTTGATTATGGTGAGTTGAATTTAAGACA 

TGACCATGAAGGCTGCTTGTAGAATTAGTGTATTTTTATTAAACTATTTTTTTAAATGTC 

AAACTTCTATCATGTAAATGGACTTATAGAGAACAAAAAGCTATTTACTTTGGTTTTCTA 

GAAAGTTGTTACATATCATGGCTGGTTAACTTTTATTTCTTTTGATGAAAATTTTTCCTT 

TGATAGTACTTGTATTATTGTGCCATTATTTTCTTATGCTCCAAATGTACCAAAGATCTT 

GAACAGAGTGGATGTTCACAACTGAGTAGAATTTTCCTTTCCTGTGGGCATGCTGTATTC 

AGACCTGACAGATCTTTGATAGAGGTCAGCTTATTAAAGGGCAATATTGTTCTTGTTTAG 

CTACATCACTGTGGTGAATATAGATGGAATTAAGGAAGTAAATGCAGGCCAGGGGGTTGT 

GATGAGAGGATAGGGGAGATAATATCAGCATCAAATTCTTTGGGTATCTCTCTAAGAATT 

AAATAATCTTTTCTAGCTTAATATTTTAATTCTAATTCAAACAACTCTGAGGTTTTGGTT 

TCATTAGTAATAGTTGAGGAATAATATACTAGCAAAGAATGGCCTAATGTTTGTCATAAC 

TGTTAATGGATGAAATTTTTTAAAGATACAACCATGATAACCATTATAAATGATCTATGA 

TCAAAATCTAAAGTGATGAATTATTTGTAGGAATGTCTTCCTAATGGGGAAGAATTGCAT 

AGGAGCATTATGCAAATCTACACAAGCTTTTATAAATGTTGCTGCTGGGTAGCTCCACAG 

TGTTTCATAAGGCCATCCTGTTTCCCCCAACTCCCCCATTTTTGGTTTGTTTCTTTTTAA 

ATATTTGTTGAGTACTTACGTGTTTATCTAACAGTTCACTTCCATTTTTCTAGTCTGGAT 

TTTTTGAGTATTTAGGAAAGAGAGCTATTAAAAACTCTGGGGATTTCTCAATGTGACTAA 

CTCTAATTTTTCTAATTATAACTGCCTTTAATTAACATAATATTAACTTTTGCTGAGGTT 

Stgagattttctcaccccacatcgctcccctttttttaaaaaggactgttttgctagtg 
SSaatgaataggtaagatatgagataattgcaacattgtcctagttct^^ 



/II) 
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FIGURE 2G 

CTATTCTTGAAATGGTATTGAAAAATACCGTTAATTCAAATTGACAGAGATTGATAAAAA 

GAAACTGATTTACCTAAGTTTACTTTTTAATTGCATAATAGAGCATTTTTTGTTTTGAGT 

TCCCTCATTCTTATTACCAGAAAGAGCTTGCAAATAGTTTTACTTTCTTGGCACTGGAAG 

GGTAGTTCTGGAAAGCTACTTTGTTGAGAGTCTCATTCTTCCCTGGAGTTAATAGAGTGA 

TTCACAATCTTTGGGGTTTTCTCCTCATCAAAAGCATTTCTTAAGTGCCTATCTAAAAGC 

AATTAAAGACTGTGTCTGCCCTTTAGAAGCTAAGAATTTGATTCATGATGCAAATTAACT 

AGATAATTTGCAAAGTACCCTTGAGATTGAATTTTCTCTATTATATATTTCCCATATTTC 

AGGTGAATAATTTAATTTAAATGACAAAACCCTATCTAGTCAACTGGGCATAATGACATT 

TTCTTTAAATTAGACTCTATTTTGAATTAAAAGAGTTTTATTATAAACCGTGTGTTTTTG 

GTTTTTCTAAGTATATAGAAAGCTTGTATAATTCAGATTTATCAATTTCCTGATTTAATG 

TAGACTTTGACTTTTTTATTAAAAACCTTTGTATTAAAGCAAGTTATGTTATTTTTCTTT 

TATGCATTTATTACTAACATAGCTTTAAATCTTTAAATGTATTGAAGCATTGTGCTGTCT 

GAAAATAAGGAATTGCTTATAAACCAGCCACTTCTGAATACAATATGTAGCTGATTTAAT 

AAGCTAGTTAGTGAATGGAAAATAAGTGTGGAGTATTAAAAATGTTCTTTGGTTGGTAAG 

GCCTAAGATAGGGTTTCATTTATTTCTATACTTTTTCTGTTTTTTAAACACCTGCATATT 

TTTATGTAAATCTCTAAATTTAAAATATTTTAAGTACATTTATTTTTGGTGTTTTATTGT 

ATAAAACCTTAGACAATCAATCAGTCAGTCTTTACTGACAGGAGCAGCAGCTATCTGTCT 

TTTGCTGATCTACAAATAAATGAATTGAGAATTTAGTCCATAGAGGTCCCTGGCTACCAA 

ACACATTCTCCTTTGAATTGTTAAAATTCAGAACATTCAAAATAACTGTTTTGCTACAAC 

CCATGATTATTTTCCTGTTGTGTTTATTTAAATTTACTTTCTCTTTAGAAGTGCACTTAT 

TTCTGAAAAATCTTAATGAAACAAACGCTTAGAACAAATATAAATATGAGACACTTGGGA 

rTACTAGAGATATTTTAGATTTTTATGAAAAAAATGTGAGGGGATATTGCTGCTTTAAAA 

AGGAATAAAGTAATAAAAATATATCTCAGCTATTTTTTTAAAGCAATATAATTCAGCAAT 

TGTCTAGAAAAGTAATCATGAGGCTACTGAGTTTGGTGTTCAGTTACTGAGTTTCAAAAA 

TGTTTTGGTGGCATGAGGACAAAATTTCATTGAAGGTAAGATAAGAATAAAAACTATGTT 

TAC 

cpn TD NO- 9 AA21082 5 H 

CACGAGGGCTACTGGCGCCTGGCGACCCTCCCTGCCCCCCACCCAACCCCGCTCCGGCAA 

CGCCCCCTTCCTCACGGCTCCCGACCGAACTTTTCTCCAACTTCTGCGACTCGTGAGATT 

CCCTTCTACCCACTCCGGCCCTCGGGACCCCTCTGCCCATCCCCTGGCCGGTCGGGTCCC 

TGCGAACCCCTTTATCTCTGGAATCCACTCGGTCCCCGACTCAGAGACTCCTGCCCTCCA 

CCCCCAAGGACCCCGCCATCCTCAGGTCCCCTCCGCCTGCCAGATCTTTTCTCGGATCCC 

CGCTCTCCCACCACCTGCTCACGAGATCCCGCGGATCTAGAACCCAGGGTCCCCCGGGGC 

CCCCCGGCGGGTCCCGGGTGGGCTCCAGGCGGGCGGTCCCCGGCCTCCCCCCATGGCCAC 

CGCCCCCTCATTATCCCGCCGGGCTCCCTGGCTCTCCCGGGCCGGGGTCTCCTCCGCCCC 

CCGGCGGCCTAGAGCTGCAGTCGCCGCCACCGCTACTGCCCCAGATCCCGGCCCCGGGTT 

CCGGGGTCTCCTTTCACATCCAGATCGGGCTGACCCGCGAGTTCGTGCTGTTGCCCGCCG 

CCTCCGAGCTGGCTCATGTGAAGCAGCTGGCCTGTTCCATCGTGGACCAGAAGTTCCCTG 

AGTGTGGCTTCTACGGCCTTTACGACAAGATCCTGCTTTTCAAACATGACCCCACGTCGG 

CCAACCTCCTGCAGCTGGTGCGCTCGTCCGGAGACATCCAGGAGGGCGACCTGGTGGAGG 

TGGTGCTGTCGGCCTCGGCCACCTTCGAGGACTTCCAGATCCGCCCGCACGCCCTCACGG 

TGCACTCCTATCGGGCGCCTGCCTTCTGTGATCACTGCGGGGAGATGCTCTTCGGCCTAG 

TGCGCCAGGGCCTCAAGTGCGATGGCTGCGGGCTGAACTACCACAAGCGCTGTGCCTTCA 

GCATCCCCAACAACTGTAGTGGGGCCCGCAAACGGCGCCTGTCATCCACGTCTCTGGCCA 

GTGGCCACTCGGTGCGCCTCGGCACCTCCGAGTCCCTGCCCTGCACGGCTGAAGAGCTGA 

GCCGTAGCACCACCGAACTCCTGCCTCGCCGTCCCCCGTCATCCTCTTCCTCCTCTTCTG 

CCTCATCGTATACGGGCCGCCCCATTGAGCTGGACAAGATGCTGCTCTCCAAGGTCAAGG 

TCCCGCACACCTTCCTCATCCACAGCTATACACGGCCCACCGTTTGCCAGGCTTGCAAGA 

AACTCCTCAAGGGCCTCTTCCGGCAGGGCCTGCAATGCAAAGACTGCAAGTTTAACTGTC 
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FIGURE 2H 

ATGTGCCGATGGAGGAGGCCACCGATTTCAG^^ 

AGTCAGAGGACTCCGGTGTCATCCCTGGCT^ 

AGGAGGAGGAAGGCGAGGGAGGCAAGGCCCAG^^ 

GGGTGGTGCAATCGGTGCGACACACGACGCGG 

GGGTGGTTCATTAC^GCAACAAGGACACG^ 

GCAAGTGTATCACGCTCTTCCAGAACAACA^ 

TGTCAGAAATCCTCACGGTGGAGTC^^ 
ACCCACACTGCTTTGAGATCGTCACTGCC^TGCCAC^ 

GCGGGACTCCGGGTGGGCCAAGTGGGCAGGGGG^ 

CAGCCATCCGCCAGGCCCTGATGCCCGTCATCC^ 

ACGCGCCCCACAGAC^GCTTCTCTGAGCATCl^ 

AAGTGTTTGTGGTGATGGAGAAGCTGC^ 

WaTCAGGAGAGCCATTTCCTCA^TGMGCTG^ 

TCGGCGAGAAGTGGTTGCGCCGCTCAGTGGTG^ 

TGCTGCTCAACCAGGGCTACAACCGCTCGCTG^ 

ACGTCAGCCTCAGCGGCACCTTCCCTTTCAAC^A CACATCTC AGCTGGAGCCATTG 
AGAACGCCGGGTTGATGTACCCGGCGAGOT 

CCATGGCCTTGATCTCAGCAAAA 
GAATTTGCAGGAAAA 

SEQ ID NO: 11_AA316804 H CTGTATTAC CCACAGCTATTCCT 
TCTTTATCTGCTGTCAAGGATCTTGTGTGCTC^ 
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FIGURE 21 

CTTTCAGCTTTAGCCACAGTAGAA^ 

TCTTACAAAGCTCCTACTTTCTC^^CTGTGGTGft^™ TCTG ccrrCAAGATT 
GAAAAGATGGTAATGTGCAGAGTGAAAGTTCCACA^ 

GAAGAGCCATCACCCCCAGAAGATA^^^ 
GAAAGAGATGAAGAAGCCGTTAAAACA^T^ 

ATGAGGGTTGTACAATCCATCAAGCACACA^GAGGAAGAGCA GGAGA CTT 
GGGTGGATGGTCCATTACACCAGCAGGGATAACCTGAGAAAG^G 

GACAGCAAATGTCTAACATTATTTCA^ 

CCACTTTCAGAAATTCTCCGCATATC^ 

AGCAATCCACACTGTTTTGAAATCATTACTGAT^ 

AATGGGGACAGCTCTCATAATCCTGTTCTTGCTGCCACTG^ 

CAGAGCTGGGAAAAAGCAATTCGCCAAGCCCTCATGCCTGTTAC 

TGCACTTCTCCAGGGCAAGGGAAAGATCACAAAG^ 

Stgtcagattcaggagaatgtggatatc^ 

GTGCTTGGTTCAGGCCAGTTTGGCATCGTTTAT^ 

GATGTGGCTATTAAAGTAAT^^^ 

CGTAATGAAGTGGCTATTTT^^ 
ATGTTTGAAACCCCAGAACC^GTCTTTGTAGTAA^ 

GAAATGATTCTATCCAGTGAGAAAAGTCGGCTT^^ 

acacagatacttgttgctttgaggaatctgcat™ 

AAGCCAGAAAATGTGCTGCTTGCATCAGCAGAG^CCATTTCC 

TTTGGATTTGCACGCATCATTGGTGAA^GTCATTCAGG^ 

GCATACTTAGCCCCTGAAGTTCTCCGGAGCAAAG^ 

TCAGTGGGAGTTATCATCTATGTGAGCCTCAGT^ 

GATATAAATGACCAAATCCAAAATGCT^ 

^^^^^^^^^^ 

CCTAATCC AGATGATATGGAAGAAGATC CTTAA 

CCCKCCCTGGGCCTGGCCCAGCTGAGCCTGTGGCCTCAGG^ 
CAGCTCAGGGCTCGGCACCTAGAGGCTCTCCGG^^ 

AAACAOK^CTGAGAACATGACC^^CGCCAGTG tcoccctgctgcg g 
CTCCTTGCAGCTGCCCAGCAGATGCTGCGGGAC GCCTGAGCTAC TGGCG 
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CCACTGGAGCGAGCCTC^ 

ATAGAGAGCCTGTACTGCGAGAAGCGGATCCTGGA^ 

TTCCTGCTCTCCCTCCTTGTCTGCTTCCAGACCTCCAG 

TTTGTGCCTGGTGGTGACCTCATGATGCAGATCCACGA^ 

GCCCGCTTCTACGTGGCTTGTGTTGTCCTGGGGCTGCAGT 

ATTTACAGGGACCTGAAGTTGGATAACCTTCTGCTGGA^ 

GCAGACTTTGGACTCTGCAAGGAAGGGATC^ 
GGCACCCCGGAGTTCCTGGCTCCCGAGGTGC^ 

TTTCTGTCGGTGCAAGGGCTTGAGTTCATTCAGAAGCTCCTCCA^ 

CGCCTCGGGGCAGGTGAGCAGGATGCCGAGG^AGATCAAG 

ACCAACTGGCAAGCCCTGCTCGCCCGCACCATCCAGC^C 

TTTGTGTCAGAG CGATTCCTGGAACCCTGA 

SEQ ID NO: 13_AI021023_M P ^^ rrATTCCTGAAGATC GCAGACTTTGG 
GCTGAAGTGGGATAACCTTCTGCTGGATGCCCAGGGA 

ACTCTGCAAGGAAGGGATCGGCTTCGGGGACCGG^CTA 

CCTGCGCTACTTTGAGGGCGAGTTCACAGGGCTGCCG^ 

CCACAGCCTCCTCACTGCCCGCCAACAGGCCGC^ 

GCGATTCCTGGAACCCTGAGGGCATCTCCTGGC^ 
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TrTCACTGGGCAAAGTGTGTCCCTTCCCCCTCCAGCTCGCCCTCTTCTACCTCCCAGCGA 
ACTTGCTTTATATTAAATTTGTAAAAGTG 

ACC. 



ATCAGGTCAGATCTGGAAGAACTCTGGGAA 



CTACGGGGGCACCACTATCTGCACCAG 



GAATCCCTAAAGCCAGCCCCAGT. 



ACTGGTAGAGAAGCCTCTGCCAGAGTGGCCAGTGCCT 



CAGTTCATCAACCTCTTTCTACCAGAG' 



TTTCCCATTAGGCCCATTAGGGGGCAGCAGCAG 



CTGAAGATTTTAGGCCTCGTGGCT. 



•AAAGGCTCCTTTGGAACTGTCCTCAAGGTGCTAGAT 



r 'GCACCCAGAAAGCTGTATTTGCAGTGAAGGTGGTG 

SSSgtga^cagtgcaaagaggaggttag 

^^SrTTGGGGGACAGCTGGCAGGGAAAACGGCACCTTTTCATTATGTGTAGCTAC 



rrTCTCTTTGCTGCCGAGTTGGTGCTGGTACTGTGTTATCTCCATGACTTGGGCATCATG 
CTTCAGTACATGGCCC^GAGGTCCTAAGTGGAGGACCTTACAACCATGCTGCTGATTGG 



Epp™ppCTQQQxGTC^GCTTTTCTCTCTGGCGACTGGAAAGTTTCCAGTGGCTGCAGAG 

SS?g?ggcSgSggc«gtgtgacg 
"aaccagggcctctcactc 
vcgttatctgcatcacttc 

cctatccctgcttga 

atgg^ttctttaggatagacagtaaggatagcgcaagtgaactcctggga^ 



reiACCAGGGCCTCTCACTCCTGCTCCATGAGCTCTTATGCCAGAACCCCCTCCATCGT 

TCCAGGTCCACCC] 

GAGCTCCTACAGAAGCAGCCAGTGAACTTTGTCACGGAGACACAAGCTACCCAGCCCAGT 



GGAGAAAAATTGTATAGTCTAAAATCAGAACCTTTGAAACCATTCTTTACTCTT^ 

taggagtttt; 
TTTG atgatgtcagt™^ 



rr-ar arArTGCTTCTAGGAGTTTTAATACTAGTGAAAGCAAGGTAGAGTTTAAAGCTCAG 

gacacS?Sgc^ 



raaTTrrAGTCAACAAGAGAAGCTGCAGCAATGGGACCTACTAAGTTTACACAAACTAAT 

a^gSSa^^ 

^^p??^T^qq^^ggaCATGAGGAGAAAGGCATAGAGGAACTGAGTGATCCCTCTGGG 

^^^aSg^ataacagagaaacactatgc 

T^aTTrCCTATTTGTAGTCCACTCTCAGGTGCTAATGAATATATTGCAAGCACAGACACT 

rrAAC^CTT^TTCCAGAGAGACTCTGAGACTAAGGGTGAAAGTGGTTTAGTGCTAGAA 
CC ^ CTTCTTT _^^^p nTTTTTGAGGAC CTTGATAAAAAATTAGCACTAGCCTCC 

ag^S^cagS 
ga?gc™acISg^^ 

GATGCTTTA J™™^^ arrTAAC GTATTTTAG caggtggagtgaggttgaagattcc 
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GAACGACTTGGTGCTGGAGTTGCTGGTGTTGAAGATATCAAATCTCATCCATTTTTTACC 
CCTGTGGATTGGGCAGAACTGATGAGATGA 

SEQ ID NO: 16_AA626690_H 

ATGCTACCATTCGCTCCTCAGGACGAGCCCTGGGACCGAGAAATGGAAGTGTTCAGCGGC 

GGCGGCGCGAGCAGCGGCGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG 

GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTATTACTCATCAT 

GTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTGAGTTGCTCAAGGTTCTTGGT 

CAGGGGTCATTTGGAAAGGTTTTTCTTGTTAGAAAGAAGACCGGTCCTGATGCTGGGCAG 

CTCTATGCAATGAAGGTGTTAAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACA 

AAGATGGAGAGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTAT 

GCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGGGAGGAGATGTT 

TTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAAGATGTGAAATTCTACCTCGCA 

GAACTGGCCCTTGCTTTGGATCATCTGCACCAATTAGGAATTGTTTATAGAGACCTGAAG 

CCAGAAAACATTTTGCTTGATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGC 

AAGGAGTCAGTAGATCAAGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATATG 

GCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGGTGGTCATATGGT 

GTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTTCAAGGTAAAGACAGAAATGAG 

ACCATGAATATGATATTAAAAGCAAAACTTGGAATGCCTCAATTTCTTAGTGCTGAAGCA 

CAAAGTCTTCTAAGGATGTTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGA 

GTTGAAGAAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATATAAA 

AGAGAAGTTCAACCTCCTTTCAAACCTGCTTCTGGAAAACCAGATGATACTTTTTGTTTT 

GATCCTGAATTTACTGCAAAAACACCTAAAGATTCTCCCGGTTTGCCAGCCAGTGCAAAT 

GCTCATCAGCTCTTCAAAGGATTCAGCTTTGTTGCAACTTCTATTGCAGAAGAATATAAA 

ATCACTCCTATCACAAGTGCAAATGTATTACCAATTGTTCAGATAAATGGAAATGCTGCA 

CAATTTGGTGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTTGC 

AAGCGATGCATACATGCAACTACCAACATGGAATTTGCAGTGAAGATCATTGACAAAAGT 

AAGCGAGACCCTTCAGAAGAGATTGAAATATTGATGCGCTATGGACAACATCCCAACATT 

ATTACTTTGAAGGATGTCTTTGATGATGGTAGATATGTTTACCTTGTTACGGATTTAATG 

AAAGGAGGAGAGTTACTTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCT 

AGTGATATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGTTGTT 

CATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGCCAGTGCAGATTCA 

ATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTCGAGGAGAAAATGGACTTCTCTTA 

ACTCCATGCTACACTGCAAACTTTGTTGCACCTGAGGTTCTTATGCAACAGGGATATGAT 

GCTGCTTGTGATATCTGGAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACT 

CCATTTGCTAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATGGA 

AAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCAAAGGATTTGCTT 

TCCCATATGCTTCATATGGACCCACATCAGCGGTATACTGCTGAACAAATATTAAAGCAC 

TCATGGATAACTCACAGAGACCAGTTGCCAAATGATCAGCCAAAGAGAAATGATGTGTCA 

CATGTTGTTAAGGGAGCAATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCAA 

CCAGTCCTAGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAGCGA 

ACATCAACTGGCCTGTAA 

SEQ ID NO: 17 AA215680_H 

ATGAGCCTGGTGGCCTGTGAGTGCCTGCCCAGCCCCGGCCTGGAGCCTGAGCCTTGCTCA 
CGAGCACGGTCCCAAGCTCACGTGTACCTGGAGCAGATTCGCAACAGGGTGGCTCTGGGA 
GTGCCTGACATGACAAAACGTGACTATCTGGTGGATGCGGCCACGCAGATCCGGCTGGCC 
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CCGCTGAGCAGTGGAGCCAGCCCCAGCGCGGGTTTCAGCAGCCTGAGGCTCCGGCCCATT 

CGCACGCTGAGCTCTGCCGTGGAGCAGCTGAGGGGCTGCAGGGTGGTCGGGGTCATCGAG 

AAGGTGCAGCTGGTCCAGGACCCGGCAACCGGAGGGACCTTTGTGGTGAAGAGCCTACCC 

AGGTGCCACATGGTGAGCAGGGAGCGGCTGACCATCATCCCACACGGAGTCCCCTACATG 

ACGAAGCTGCTCAGGTACTTTGTGAGCGAGGACTCCATCTTCCTGCACCTGGAGCATGTG 

CAAGGAGGCACTCTCTGGTCCCACCTGCTCTCCCAGGCGCACTCCCGACATTCTGGGCTC 

AGCTCTGGCTCTACCCAGGAGAGGATGAAGGCTCAGCTCAACCCCCACCTCAACCTCCTG- 

ACCCCAGCGAGGCTTCCCTCAGGCCATGCCCCTGGCCAGGACAGAATCGCCCTGGAGCCT 

CCTAGGACTTCTCCGAACCTTCTCCTAGCTGGGGAGGCCCCATCCACCAGACCCCAGAGG 

GAGGCTGAAGGTGAACCCACAGCCAGGACCAGCACCTCTGGCTCCTCGGACCTTCCAAAG 

GCCCCAGGTGGCCACCTGCACCTTCAAGCTAGGAGGGCTGGCCAGAACTCAGACGCTGGG 

CCCCCTCGGGGGCTCACTTGGGTTCCTGAGGGGGCCGGCCCGGTGCTAGGGGGCTGTGGC 

CGAGGCATGGATCAGAGCTGCCTGTCAGCAGATGGGGCCGGCCGGGGCTGTGGCAGGGCC 

ACCTGGAGTGTGAGAGAGGAGCAGGTGAAGCAGTGGGCGGCAGAGATGCTGGTAGCGCTG 

GAGGCGCTGCACGAGCAGGGGGTGCTGTGCCGGGACCTCCACCCCGGGAACCTGCTCCTG 

CACCAGGCAGGTCACATCCGGCTCACATATTTTGGCCAGTGGTCAGAGGTGGAGCCCCAG 

TGCTGCGGGGAGGCCGTGGACAATCTCTACAGCGCCCCAGAGGTGGGTGGGATTTCCGAG 

CTGACGGAAGCCTGTGACTGGTGGAGCTTTGGGTCTCTACTGTATGAACTGCTGACGGGA 

ATGGCACTGTCCCAGAGCCACCCTTCAGGAATCCAGGCCCACACCCAGCTCCAGCTGCCC 

GAGTGGCTCAGTCGCCCAGCGGCCTCTCTGCTGACTGAGCTGCTGCAGTTCGAGCCTACC 

CGGCGCCTGGGCATGGGAGAAGGTGGTGTCAGCAAACTCAAGTCCCATCCCTTTTTCAGT 

ACCATCCAATGGAGCAAGCTGGTGGGGTAA 
^EO ID NO- 18 SGK H 

atgacggtgaaaactgaSgctgctaagggcaccctcacttactccaggatgaggggcatg 

CTCGCAATTCTCATCGCTTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAG 

aagattgccaataactcctatgcatgcaaacaccctgaagttcagtccatcttgaagatc 

TCCCAACCTCAGGAGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCT 
CAGCAAATCAACCTTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTC 
TTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCAGAA 
rAAGTGTTCTATGCAGTCAAAGTTTTACAGAAGAAAGCAATCCTGAAAAAGAAAGAGGAG 
AAGCATATTATGTCGGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTG 
CGCCTTCACTTCTCTTTCCAGACTGCTGACAAATTGTACTTTGTCCTAGACTACATTAAT 
GGTGGAGAGTTGTTCTACCATCTCCAGAGGGAACGCTGCTTCCTGGAACCACGGGCTCGT 
TTCTATGCTGCTGAAATAGCCAGTGCCTTGGGCTACCTGCATTCACTGAACATCGTTTAT 
AGAGACTTAAAACCAGAGAATATTTTGCTAGATTCACAGGGACACATTGTCCTTACTGAT 
TTCGGACTCTGCAAGGAGAACATTGAACACAACAGCACAACATCCACCTTCTGTGGCACG 
CCGGAGTATCTCGCACCTGAGGTGCTTCATAAGCAGCCTTATGACAGGACTGTGGACTGG 
TGGTGCCTGGGAGCTGTCTTGTATGAGATGCTGTATGGCCTGCCGCCTTTTTATAGCCGA 
AACACAGCTGAAATGTACGACAACATTCTGAACAAGCCTCTCCAGCTGAAACCAAATATT 
ACAAATTCCGCAAGACACCTCCTGGAGGGCCTCCTGCAGAAGGACAGGACAAAGCGGCTC 
GGGGCCAAGGATGACTTCATGGAGATTAAGAGTCATGTCTTCTTCTCCTTAATTAACTGG 
GATGATCTCATTAATAAGAAGATTACTCCCCCTTTTAACCCAAATGTGAGTGGGCCCAAC 
GAGCTACGGCACTTTGACCCCGAGTTTACCGAAGAGCCTGTCCCCAACTCCATTGGCAAG 
TCCCCTGACAGCGTCCTCGTCACAGCCAGCGTCAAGGAAGCTGCCGAGGCTTTCCTAGGC 

TTTTCCTATGCGCCTCCCACGGACTCTTTCCTCTGA 

rrCGTCGACCCACGC^TCCGCCGGTTTCACTGCTCCCCTCA 
CGGGCATCGGGACGATGACCGTCAAAGCCGAGGCTGCTCGAAGC^ 
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WO 00/73469 

FIGURE 2N 

GAATGAGGGGAATGGTAGCGATTCTCATCG^CTTT^ 
ACGATTTTATTCAGAAGATTGCCAGCAACACCTATGCATGCA^ 

CCATTTTGAAAATGTCCCATCCTCAGGAGC^CGGAGCTT^ 

CGCCAAGTCCCTCTCAACAAATCAACCT^ 
CCGACTTTCACTTCTTGAAAGTGATCGGA^G^^ 

TCGTCCTCACTGACNTATTTCAGCTGCGTAGAATCGAG^ 

CCTTCCTCGGCTTCTCCTATGC^ 
GGTTCTGAAGGACTTCCTCAGCGT^CCT^ 

CCAGCTGACAGAAC^ 
CCGGCGTGGCGCGACGCAGCGCGCGCT^^^ 
CTCTCAATGAGCTTGTGAGGTCTTCTTTTCTTC^ 
AGGCGAGCGAGCGTGAGAGTGCCGCCTGAGACAGACAC^ 

ATGCAGGTCTAAGAGGAATCCCCGCAGGTCTGT 
ATGTGCCTTTTCTGAGATCGTGTTAGCTCCAAAGCTTTTT 

TTGTGTTTGTTTGTTTTTGTTTTGTTTTGTTTTTCCCTI CATC AATGTGACAC 

gtggcgtgagtgtgctatgcctgatcacaga^ 
ttgcaggacactacaatgtggg™ 

tgtgtagactgttttgtaagatatagttaa™cia^ ATTTCT atttttagaa 

TGACGAGCATTCAGATGCTTAAG^ 
AGGGTTTTTATGGACCA^TGCCCCA^ 

CAATTTTGGTTTGCAATAAAATCTTGAAAACT 

SE Q ID NO: 20 -^^^°^ CCTGGAGCC CCGGGCCAGGTTCTACGCTGCTGAGGT 
CCACCTGCAGCGGGAGCGCCGGTTCCTGG^ 

GGCCAGCGCCATTGGCTACCTGCACTCCCJ^ 
GAACATTCTCTTGGACTGCCAGGGA^ 

W»3 
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WO 00/73469 

FIGURE 20 



catcttgc^ttcctaga^^ 

AgWtACCTTCAGCTGCTAGG^ 



GACAGGTCATCAGATACTCAGAGGCTGlAl^* AGCTGCCT ATTTTGTGTTTAG 

SgSatcaatatctcttattgttct 

SEQ ID NO: 21_AA887783 H TGCAAAGAGA TCACACCATGGACT 

cSSSttgtgggacagcagagtmg^ 

CATATTCTGTGTGTGTATCTTCTGACTATTCTATAGT^ 

SStgcattcgttggtttctc^ 
tttcccattcagaaaccattgagcaaaat^ 

gtgtgaatatattcaaatatgta^ 

^I^rrCATGGTRTGATCG^TOTA^AGCATOATTAR GGGCGCGGT 



PCTAJS00/14842 
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FIGURE 2P 

cSgggcaacaagagtgaaactccatctcc 

SEQ ID NO: 22_R47805 H GAGCTGAAGG AATTCTTTGCCAAGGCA 
ATGGCGCACCAAACGGGCATCCACGCCACGG^^^ 

CGGGCTGGCTCTGTGCGGCTCATCA^ 

GCCTCGCAGGAGCCAGTAGGCCGCTGGGAT^ 

CTGGACGCCCAG(^GCCCTGCTACCTGCTCTAC^^^ 

Sggtgaagacagagatcagtgtggaaa^^ 

CCCCTGCAGCCTOAGGCC«GCGGGCRCT^GCMC^ GCTGGTG «CACAGAGCCCACG 
ATCCAGATGRAGCTGGACCTRGAGCGGG^CCRTT TACCACTTC TTCCTC 
GATGTGGCCCAGCTGCCCTCCCGGGTGCCCC^^ 



GGCCCGGGTGAAAATGGGGATGACAGCTAG 
fcAcSGGcBcS 

TGTCAAAGCTGAGAATGAAGCGGAGAGCATC^ 

GGGCCAAGGCTCTAGGAAGTGGGATTTCTGGA^TAA AGTGTTTGGCGAG GA 
TCCTTGGTCCCCGTCTGGGCAACTCACCGGTGCCAAGC^ 

AAGATGGCACGGATGACITCTATCAG^ 

AAGGCATAGAGAGCCAGGAAGAGCGGCAGGGCAAG^ 

TGCTGTCTCTCCTGCACACGCAGGATGGCGTGGTGCACC^ 

nCACCTGTGAAATCGTTGAGGACAC^ 
CATCTGCCTCGTCCTGGACTG^ 



TAATCTTCU , . " ~ rTrrTCA ACAAGAGGACACA'l 

ACAGcScCCGCAGGAGCTCTTC^ 
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FIGURE 2Q 

AGTCCCTGTCATCTCTGAGTGGGCCTTTGCAAGTGGTTCCTGACATTGATGACCAAATGA 
rr*ATrrGGATAGCTCCCAGGAGGCGAAGGTGACGGAGGAGTGCTCCCAGTACGAGTTTG 

ag^SaS?gcg?c^ 

gga^Sggctacccaagcggcagttcggcagcgcaccaccggtgcg^ 
acgSSgSca^ctccttggacacggccatcctggcgcag 
a^cSotcagccggggccaccagcactgctgccacttcttccagccccagccaaaggc^ 
tcgctgtcagggctgggccctgtagtgctggactctcccgggccacaatagggacagggc 

IgggISSgacagc^ 
tggStga??Sctcgattg™^ 

aIga^ggaggagggagcctgctcaccccactccagggccttccccagcggccaccaa 
a^SggLcgctgctccccacagtccaaataagctgaaagtgcagctcgctgcaggcc^ 
agagcgagcttcccctcctccctgctctcccaggcccctgccacagcctctttccgtccc 
tctctttctgatccaggcccctcagtccaagctttggaaaaccttcacctcatcttaaac 

CAAACTCAAATATATTTATTTTTTTACCAT 
GCCGC^TGGCCAGCACCAGGAGTATCGA 

ccgcggccggggtcgcggagaggggcccccagctcctccgggggcagcagcagctcgggc 
cccSggggaacgggctcatccccagtccggcgcacagtgcccactgcagcttcta^ 

ArrrGGACCCTGCAGGCCCTCAGCTCGGAGAAGAAGGCCAAGAAGGCGCGCTTCTACCGG 

a^2gggga?cg™?^ 

rGTGTCCGCACTATCTACACCATCGACGGCAGCCGGAAGGTCACCAGCCTGGACGAGCTG 

SggSggtcagagttac^ 
Saaa^a?^?ccaSgtggtctgtgaacatcaaggg 

gc^^ctCctctgtgaaaagtgaagtaaaagaaagtaaagatttcatcaaacccaagtta 
gtcaSgtSScgaagtggagtgaagcct^^ 

aIgaSgctcattcctttgaacaagtcttaacagatatcaccgaagccatta^ 

^agSotcgt?aaLggc^ 

catctg^cagacttttttggtgatgacgatgt^ 

cgSSgccca^gatLctttgtcctggatcatagtg 

Igccaga??tctgctcatggcagatcttcttccaa^ 

Sag^Igg^ctggaaaggagtttgccctaaagattatagacaaagc 

gga^Saacacctgattgagaatgaagtgtcaatactgcgccg^ 
a?c^?atcc?ggtcgaggagatggaaacagcaactgagctctttctggtgatgg 

PTrAAAGGTGGAGATCTCTTTGATGCAATTACTTCGTCGACCAAGTACACTGAGAGAGAT 

ggca^gccSgg?^ 

g?gcacagLacatcaaaccagagaatctcttggtg^^ 

?c?Sgaaactgggagactttgggcttgcgactgtggtagaaggccctttatacacaot 
g?ggacaStgScLctggtgtgatcac 

cgSgtgLaacaatctccaggaagatctcttcgaccagatcttggctcksg 



PCT/US00/14842 

WO 00/73469 

FIGURE 2R 

GTGAGTGGAAGGCGGCAGGTCTCGCCT 
GGTAGCCGGGAGCTGCCCTCA^^^ 

AGATTGGCTTGGCATGTGGAGGGCACTCATT^ TCCAGGC GTCAAGGAGCTC 
GGAGGGGCTTGTGTAGGGACCAGC^^ 

CTGGCTGGGCCCTCTGGGCAGCTGCTTCCACTCTT CCTGCC TACCCCAAGGCC 

CTCCCAAGCCCTGGAGGGGTGTGTTGTGTT^ 

TCAGAAATAGATTATTAGAGATGTGAATTATTC^ 

AAGCTAAACATATTTCAGTTTTAAA^^ 
TAAAGGTACATAATCAAGGAAAAAAATATATATT^ 

TGAGATGTCAGTGACAACGATGGCCTTATTTTi^ ^ TG ^ GTGAC ATGAG 
TCTTAAGGCAACTCTCCTAAATACATAAACA^ 



GCTGTCCAGGCCTGAGCCAATGCAACCCTGGGC^ 
AGAAGCCAGCCAGCCACCCAGGCAGGGACCTTGGTTC^ 

TOTCATGGTACATGGTTGTCTGA^GCCTTACCTOOQflA CCCCACAC TCC 
GCT^GGGCAGATCTTCTTCCAACGTAAACGGTGGGCC^ 

^CTGGGACAACATTACAGACTCTCCTTGTGTGT^ 

cgcSggacgggggcatggtg^^ 

^GCAGGCCCC^TG^ 



ACCAACCTGGCAGA 1 
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FIGURE 2S 

CACAAGCTCCCCCC^^^ 

AGTCCCCCATTTAGTCTGAAGCTGTCC^ 

AAGCTCCTGACTGAGAAGGTCAAGTTGCAGAGTCGGGCT^ 

GGGCTCCCACTGTCAGCAGGGAAGGAGCTGGTAACTC 

gaggatc^gttcaaggacotccctatccag™ 

GGCAATGAAGCCCACCTGAGGAGT^ 

ggaaggaaggctaaga^ggagaca^ 

TGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGATC 

GAGCCCAAGACGAGGCCAGAAGAGAAC^ 

CCCATGGGCATCATTGCCGCCAATGTGG^ 

GATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCG^ 

ATGAAGATCATTGACAAGTCCAGACTCAAG^^ 

ttgatcatccagagcctctctca™ 

GACATGGAAATCTACCTGATCCTGGAGTACGTGCA^ 

ATAGAAAGTGTGAAGTTCCCGGAGCCCG^ 

GCCCTCGTCCACATGCACGACAAGAGCAT^ 

TTGGTTCAGCC^TGM^ 
AAGCATGTGGTGAGACCTATATTTACTGTGTGT^ 

ATTCTTTCTGAGAAAGGTTATGGACTGGA^ 

CAATGTTAAAT G TCACA>CA^Tm^T^TAW^ TOTCTTTT(3 
CTTT^AACCAAGT^ATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCC 

CTGTGAGATTAATAAGGTGCATTG 

SEQ ID NO: 28 -^i^^^ CCGCCGC cGCCGCCAGCGACCCCCGCCCCGCCGGCA 
ATGCCAACCGCGCCGGTCCTGCGCCCGCCGCCGC^ 

CCCAGTCGCCCTGCGCCTCCCATTCCGG^ 

TGCTTAAGCTCGAAGATCTCTGAGAGAAAGC^ 

GGACCTCTGGAGAAGCCAGTTCTGGGGCCACGTC 

CAGAGCAGCCTCCACTCAGTCCGCGC^ 

GTGGTGAAGCTGGGTGGGCAGCCCCTCCGTAA^ TGGGC TTCCCACGCTGG 

gtgcagacctttgagcagct™^ 

AAGAACGACCGTGTGCGGAAGCTGTTCACCC^ 

gacttcttccgggagggtgatgctttcatagc^ 
caSgS 



PCT/US00/14842 

WO 00/73469 

FIGURE 2T 

AGTTATCCTCAGGGGGCACCCAAGGCCCAGAAGGACTTC^ 
GAGGAGGGGCCGATAGACATGAGGAGAGAGGACCGGCACACA 

GAAATTCTTTCTGAGAAAGGTTACGGCCTG^ 
CTATACATCCTCTTGTGTGGCTTCCCCCCTTTCCGAAGT^ 
CTCTTCAACATCATCCAAGTGGGCCAGTTTGAGTTCCTCT 
TCTGATGCTGCCAAAGATCTGGTGAGAAATTTGCTGG^AGGT 

GTTGCAGAGCAGATGCCATAA 

GGGGGCAGTCCCGGGAGAACCTGCGGCGGCCGGAGC^GGT 

CAGACCTGGGAATCACCTAACATGTCGAGG^GATT^TT ^^tata 

CTACTAACTACAACTCCTCAAATTCCAATAA^ 

CTTACATCTAAAGAGCTAGGGAGAG^ 

CGGGCAGAAATTTTACACGAGATTGCTGTGCll^ TTTTGATATTGG AATATGCT 
ATTAATCTTCATGAGGTC^ 

GCAGGTGGAGAAATTTTCAGCCTGTGTTTACL ifc* ATT ATCTA CATCAGAATAAC 

GATGTTATCAGACTCATTAAACA^TACTTGA^GGAG^ 

ATTGTACACCTTGATTTAAAGCCACAGAATAT 

C^CATTAAAATAGTAGATTTTGG^ 

GAAATCATGGGAACACCAGAATATTTA^ 

ACAGC^CAGATATGTGGAATATTGGTA^^ 



PCT/US00/14842 
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FIGURE 2U 

TTAGTAAAAAATCCAGAGAAAAGACCAACA^CAGA^ 

CAGCAGTGGGACTTTGAAAACTTGTTTCACCC^TGAAGAAA^TTCC jqtAATGG AAC C 

CAGGATCATTCTGTAAGGTCCTCT^ 
TGTGGTGATAGAGAAGACAAAGAGAATATCC^ 

TTTCGTTTCGATGACTCATTACCCAAT^^ 

CACTTTTTTCTTTGACTCATTTGGACTG^T^ 

TATGATTTGTAGCTTCATATATGACATGTTTAT^^ 

SagTS 

GTGAAAG 

SEQ ID NO: 3°- W ^^°-1?^tCAACTGTAAGAAGCGTCGGTCAGCGTCTGTGCG 

ccagacgcggctgcacttttcaaacc^ 

gtcgccgccgggagtcgcctcacagg^^ 

gagtgcgaggtaaaagtctgcctagagaag^ 

ggagattcgattgccgaagtgtctcaggct^^ 

aaacagagaattttaataatttcta^^ 

ttgctgtggttagacaatgtatatc^ 

tgaaaaagaggagaagagggcagg^ 

tggagctggccaggtcttgtccccacgtgattaatci^ TTTTCAACCTGT gtttac 

Lgaaatcattttggtgttagaatatgctgc^ 

ctgagttagccgaaatggtatctgaaaatgatg™^ 

AAGGAGTTCATTATCTACATCAGAATAACA^ 
TATAAATGCACTTCTGCTTAGAAGAACTTAA^^ 

GCTAGCATATCATTTCTTGTCCTGAAATTGTTTTGCA ^ctcaagTTCAGTCA 

CAAAAAATGTAAATTGTGTTTAAGAGAA^ 

GACTTATAAAATGGGTTATATTATGG^ 

TTAGTAGGTTCTAAGGTAAGCCCTATACCATAALiu c AGAGAG AGTTTA 

AATGCTGTCAAGGGTAAACCACAACATA^CTGOT 

TAGTTGAAAGTATTTCCCAGTTACCAATAA^^ 

GCCATGTGCTCGGTGAGAGGACACAGTCAA^ 

TGCAAACCGAGTCAAAACTCGACATCATT^ 
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FIGURE 2V 

gSSStacagcctttaaacaacttc 

SPO ID NO: 31 H01248_H. DRA ^-" rTrcTCCCC AGGCGCCACCTCAGGCTCG 

Sgatgttcaaagacttatgcgaca^ 

tcacSttcttaggcaatgataaacaa^ 
IStattctgaggaagaatttgatgttttgtctg^ 

rGCGCCGGGGTCCCCAGCCGCCCCOTCTG^^^^^^^^^^^Q^Qj^'TCGACCG 

SSgccccggcctcccgcggacccatgcccgcccgta GCRCCTCGTC accaaggc 

S^CGGCAACMGCAACTTCGCGGTCG^GCGGGC cmGAAGAR 
^^TTGCTATCAAGATCATAGATAAGACCCAGCTGGAl A TCAGGCTCTACCA 

Iotgaactctttgaaggaaaagaatatga^ 
Sgcgggcccgcgtgctgagtggaaagttccg^ 



WO 00/73469 
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FIGURE 2W 

TGCCTATGATCACTATAGTGCAATCTACAGCCTGCTGTGTGATCGACATAAGAGACATAA 

AACCCTGCGTCTCGGAGCACTTCCTAGCATGCCCCGAGCCCTGGCCTTTCAAGCACCAGT 

CAATATCCAGGCGGAGCAGGCAGGTACTGCTATGAACATCAGCGTTCCCCAGGTGCAGCT 

GATCAACCCAGAGAACCAAATTGTGGAGCCGGATGGGACACTGAATTTGGACAGTGATGA 

GGGTGAAGAGCCTTCCCCTGAAGCATTGGTGCGCTATTTGTCAATGAGGAGGCACACAGT 

GGGTGTGGCTGACCCACGCACGGAAGTTATGGAAGATCTGCAGAAGCTCCTACCTGGCTT 

TCCTGGAGTCAACCCCCAGGCTCCATTCCTGCAGGTGGCCCCTAATGTGAACTTCATGCA 

CAACCTGTTGCCTATGCAAAACTTGCAACCAACCGGGCAACTTGAGTACAAGGAGCAGTC 

TCTCCTACAGCCGCCCACGCTACAGCTGTTGAATGGAATGGGCCCCCTTGGCCGGAGGGC 

ATCAGATGGAGGAGCCAACATCCAACTGCATGCCCAGCAGCTGCTGAAGCGCCCACGGGG 

ACCCTCTCCGCTTGTCACCATGACACCAGCAGTGCCAGCAGTTACCCCTGTGGACGAGGA 

GAGCTCAGACGGGGAGCCAGACCAGGAAGCTGTGCAGAGGTACTTGGCAAATAGGTCCAA 

AAGACATACACTGGCCATGACCAACCCTACAGCTGAGATCCCACCGGACCTACAACGGCA 

GCTAGGACAGCAGCCTTTCCGTTCCCGGGTCTGGCCTCCTCACCTGGTACCTGATCAGCA 

TCGCTCTACCTACAAGGACTCCAACACTCTGCACCTCCCTACGGAGCGTTTCTCCCCTGT 

GCGCCGGTTCTCAGATGGGGCTGCGAGCATCCAGGCCTTCAAAGCTCACCTGGAAAAAAT 

GGGCAACAACAGCAGCATCAAACAGCTGCAGCAGGAGTGTGAGCAGCTGCAGAAGATGTA 

CGGGGGGCAGATTGATGAAAGAACCCTGGAGAAGACCCAGCAGCAGCATATGTTATACCA 

GCAGGAGCAGCACCATCAAATTCTCCAGCAACAAATTCAAGACTCTATCTGTCCTCCTCA 

GCCATCTCCACCTCTTCAGGCTGCATGTGAAAATCAGCCAGCCCTCCTTACCGATCAGCT 

CCAGAGGTTAAGGATTCAGCCTTCAAGCCCACCCCCCAACCACCCCAACAACCATCTCTT 

CAGGCAGCCCAGTAATAGTCCTCCCCCCATGAGCAGTGCCATGATCCAGCCTCACGGGGC 

TGCATCTTCTTCCCAGTTTCAAGGCTTACCTTCCCGCAGTGCAATCTTTCAGCAGCAACC 

TGAGAACTGTTCCTCTCCTCCCAACGTGGCACTAACCTGCTTGGGTATGCAGCAGCCTGC 

TCAGTCACAGCAGGTCACCATCCAAGTCCAAGAGCCTGTTGACATGCTCAGCAACATGCC 

AGGCACAGCTGCAGGCTCCAGTGGGCGCGGCATCTCCATCAGCCCCAGTGCTGGTCAGAT 

GCAGATGCAGCACCGTACCAACCTGATGGCCACCCTCAGCTATGGGCACCGTCCCTTGTC 

CAAGCAGCTGAGTGCTGACAGTGCAGAGGCTCACAGCTTGAACGTGAATCGGTTCTCCCC 

TGCTAACTACGACCAGGCGCATTTACACCCCCATCTGTTTTCGGACCAGTCCCGGGGTTC 

CCCCAGCAGCTACAGCCCTTCAACAGGAGTGGGGTTCTCTCCAACCCAAGCCCTGAAAGT 

CCCTCCACTTGACCAATTCCCCACCTTCCCTCCCAGTGCACATCAGCAGCCGCCACACTA 

TACCACGTCGGCACTACAGCAGGCCCTGCTGTCTCCCACGCCGCCAGACTATACAAGACA 

CCAGCAGGTACCCCACATCCTTCAAGGACTGCTTTCTCCCCGGCATTCGCTCACCGGCCA 

CTCGGACATCCGGCTGCCCCCAACAGAGTTTGCACAGCTCATTAAAAGGCAGCAGCAACA 

ACGGCAGCAGCAGCAGCAACAGCAGCAACAGCAAGAATACCAGGAACTGTTCAGGCACAT 

GAACCAAGGGGATGCGGGGAGTCTGGCTCCCAGCCTTGGGGGACAGAGCATGACAGAGCG 

CCAGGCTTTATCTTATCAAAATGCTGACTCTTATCACCATCACACCAGCCCCCAGCATCT 

GCTACAAATCAGGGCACAAGAATGTGTCTCACAGGCTTCCTCACCCACCCCGCCCCACGG 

GTATGCTCACCAGCCGGCACTGATGCATTCAGAGAGCATGGAGGAGGACTGCTCGTGTGA 

GGGGGCCAAGGATGGCTTCCAAGACAGTAAGAGTTCAAGTACATTGACCAAAGGTTGCCA 

TGACAGCCCTCTGCTCTTGAGTACCGGTGGACCTGGGGACCCTGAATCTTTGCTAGGAAC 

TGTGAGTCATGCCCAAGAATTGGGGATACATCCCTATGGTCATCAGCCAACTGCTGCATT 

CAGTAAAAATAAGGTGCCCAGCAGAGAGCCTGTCATAGGGAACTGCATGGATAGAAGTTC 

TCCAGGACAAGCAGTGGAGCTGCCGGATCACAATGGGCTCGGGTACCCAGCACGCCCCTC 

CGTCCATGAGCACCACAGGCCCCGGGCCCTCCAGAGACACCACACGATCCAGAACAGCGA 

CGATGCTTATGTACAGCTGGATAACTTGCCAGGAATGAGTCTCGTGGCTGGGAAAGCACT 

TAGCTCTGCCCGGATGTCGGATGCAGTTCTCAGTCAGTCTTCGCTCATGGGCAGCCAGCA 

GTTTCAGGATGGGGAAAATGAGGAATGTGGGGCAAGCCTGGGAGGTCATGAGCACCCAGA 

CCTGAGTGATGGCAGCCAGCATTTAAACTCCTCTTGCTATCCATCTACGTGTATTACAGA 

CATTCTGCTCAGCTACAAGCACCCCGAAGTCTCCTTCAGCATGGAGCAGGCAGGCGTGTA 



WO 00/73469 
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FIGURE 2X 

ACAAGAAACAGAGAGTTTTGTGTACAGCTTGGGAATGAAAAGGTTGATTGTAAACCCACA 
GTATCTAGCAGCGTTGTGCCAAATTGCCCTTGTGTTTCTCTCCACCCAAAATATCACAGC 
TGCTTTCCTCACATTTGGTTCATCCGTGTGCTGTTCTTTTGGGTTCTGAGAGGGTTTTGC 
CATGTTTGCTTGTATGACCAAGTCACCAAGGAAATAAACAGGAAGGAAATCCATGTTCTC 



SEQ ID NO: 33_2R22-5-ll_H 

CTGGGCCGCTGCCGGTCAGGTCGGCCGCCCCTGACAGCTCCGGGAGCCTCAAGCGCGACA 

GGGCGCCCTCACCTCGGGACATCCACACACCGACCGCTCCTGCTCCAGAGGCAACAACCC 

AGCGCGCCTAGCCTGGCGCCGTGCAGCGAAGCCCAAGAGCTGGCCTCGCCACGAAGGTTG 

AACCAGCCAAATTTTCGAGACAGCTCACGGCTTAGAGGAAGGTTCATCTAAATAAAGGCC 

GGCTAAAGTGACATTGCAGGGATTAAATCCTTCTTTGGCTGCCTGTGTGACCAGAAGGCT 

TATTTGCAAGTTTCTTCTTTCCTGGGGTCCAGATTATTAGGTCTCCAGCGCCCTGCAGCT 

TGACAGAAAGAGAAGCATGAAATGAAGGTCAGAGATGAGATCCCGCAGCAGGGACGTGGG 

GGCCTCCCAGGGGCATTTACGCACCAGAGTGCAAGATTCTCTGGCCATCAAGGGAAATAG 

CAAACAGAAGCCTTTGTCCTGGGGCACAGCCACCTACCACAAAGCATCAGACTCCACGTC 

TGGCCAGAAAGTTCCTGGAGTCCCATCAGGCCAGTGGGTATGTAACATGTGCCTAATTGT 

ACAGCTAGAGCCTGCAAGTTCAACGTGAGGGAAGGTGGGAAATGTCTTGAGTGAGGCGAG 

CAGCTCCTGGCTGGGCTGGGCAGACTCAGCTACCACGTTCACTGCCTTCCTCTCACTAAA 

GCCGAGAGGGAGGCTGCTCAGCTCTCAGGAAAACTCTTTTGAACCCTGGGCACCTGCTGT 

CCTCAGTTGGCATCTCCCACCCTCTGAGCCTCTTCTGCTCCTGCACAACCTGCCTCTTCG 

CTGAGATGGAGACGTGAGCCCCCGTGGACGATGACTGCAGTGTATATGAATGGAGGTGGC 

CTGGTGAACCCCCACTATGCCCGGTGGGATCGGCGCGACAGTGTAGAAAGTGGCTGTCAG 

ACCGAGAGTAGCAAGGAGGGTGAGGAGGGACAGCCCCGCCAGCTGACGCCCTTCGAGAAA 

CTGACACAGGACATGTCCCAGGATGAGAAGGTGGTGAGGGAGATCACGCTGGGGAAACGG 

ATAGGCTTCTACCGAATTCGAGGGGAAATCGGAAGTGGAAACTTCTCCCAAGTGAAGCTT 

GGGATTCACTCCCTAACCAAAGAAAAGGTGGCCATTAAGATCCTGGACAAGACCAAGTTA 

GACCAGAAAACCCAGAGGCTACTATCCCGAGAAATCTCCAGCATGGAAAAGCTGCACCAT 

CCCAACATCATCCGCCTTTACGAAGTGGTGGAGACCCTATCCAAGCTGCACTTGGTGATG 

GAGTATGCAGGGGGTGGGGAGCTCTTCGGAAAAATTAGCACTGAGGGGAAGCTCTCTGAA 

CCAGAAAGCAAGCTCATCTTCTCCCAGATTGTGTCTGCCGTGAAGCACATGCATGAAAAC 

CAAATTATTCATAGAGATCTGAAAGCAGAAAATGTATTCTATACCAGTAATACTTGTGTG 

AAGGTGGGCGATTTTGGATTCAGCACAGTAAGCAAAAAAGGTGAAATGCTGAACACTTTC 

TGTGGGTCTCCTCCCTACGCTGCGCCTGAACTCTTCCGGGACGAGCACTACATCGGCATT 

TACGTGGATATCTGGGCCTTGGGGGTGCTTTTGTACTTCATGGTGACTGGCACCATGCCA 

TTTCGGGCAGAAACCGTGGCCAAACTAAAAAAGAGCATCCTCGAGGGCACATACAGTGTA 

CCGCCGCACGTGTCAGAGCCCTGCCACCGACTCATCCGAGGAGTCCTTCAGCAGATCCCC 

ACGGAGAGGTACGGAATCGACTGCATCATGAATGATGAATGGATGCAAGGGGTGCCATAC 

CCTACACCTTTGGAACCTTTCCAACTGGATCCCAAACATTTGTCGGAAACCAGCACTCTC 

AAGGAAGAAGAAAATGAGGTCAAAAGCACTTTAGAACATTTGGGCATTACAGAAGAGCAT 

ATTCGAAATAACCAAGGGAGAGATGCTCGCAGCTCAATCACAGGGGTCTATAGAATTATT 

TTACATAGAGTCCAAAGGAAGAAGGCTTTGGAAAGTGTCCCAGTCATGATGCTACCAGAC 

CCTAAAGAAAGAGACCTCAAAAAAGGGTCCCGTGTCTACAGAGGGATAAGACACACATCC 

AAATTTTGCTCGATTTTATAAATTGCACTAGACTGCTTGTAACTAACCAAGATGATTGTT 

GCTGCTTCTAAATTTTTTTCAAGGACAACTTGAGTGGAGACATTTTTGTAATTTTTAAAT 

AAACTTAAATTTGAGATATGCAAAAAAAAAA 
SEQ ID NO: 34 R31237_1_H, AAC334 87 

ATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCA 
CATGGAGATGGGCGTCAAGAAGTTACCTCTCGTACCAGCCGCTCAGGAGCTCGGTGTAGA 
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FIGURE 2Y 

AACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAA 

ACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCAGA 

GAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCAACAAGTCTACAAAAGCTC 

TTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAA 

GTCATTGAAACTGAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTGAAGTA 

TTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCAAGATCTAAATTTAGA 

CAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAG 

GCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGCAGATTTCGGTTTTAGC 

AATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTCCATACGCAGCA 

CCTGAGCTCTTCCAGGGCAAGAAATATGACGGGCCAGAAGTGGATGTGTGGAGTCTGGGG 

GTCATTTTATACACACTAGTCAGTGGCTCACTTCCCTTTGATGGGCAAAACCTAAAGGAA 

CTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGT 

GAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAGCAA 

ATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTT 

GAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATAT 

TCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACA 

TATTTGTTATTGGGGAGAAAATCTTCAGAGCTGGATGCTAGTGATTCCAGTTCTAGCAGC 

AATCTTTCACTTGCTAAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTCT 

CCTCACCACAAAGTGCAGAGAAGTGTTTCTTCAAGCCAAAAGCAAAGACGCTACAGTGAC 

CATGCTGGACCAGCTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACAAGCACT 

GCAGATGGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAAGTGGCAGTGCTGTT 

GGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAG 

GCGGATATTCCTGAACGCAAGAAAAGCTCCACTGTCCCTAGTAGTAACACAGCATCTGGT 

GGAATGACACGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGCTGATAGACACTCA 

GTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATCAGAGAACTCCAGTTGCTTCA 

ACACACAGTATCAGTAGTGCAGCCACCCCAGATCGAATCCGCTTCCCAAGAGGCACTGCC 

AGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCT 

CCTGCCTCTCCCAGCCTGTCCCATGAAGCCACACCATTGTCCCAGACTCGAAGCCGAGGC 

TCCACTAATCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCT 

GAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATCCCTACGCTTCACCTGGAGCATG 

AAAACCACTAGTTCAATGGATCCGGGGGACATGATGCGGGAAATCCGCAAAGTGTTGGAC 

GCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGCGTCCACGGAGAT 

GGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCT 

CTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATCCATAGCCTTCAAAAATATTGCT 

TCCAAAATTGCCAATGAGCTAAAGCTGTAA 
SEO ID NO: 35 W90839_M 

AAAGGGCCGTCCTGGTCCAGCCGTTCCCTGGGTGCCCGTTGCCGGAACTCTATCGCTTCC 
TGCCCTGAGGAACAACCCCATGTGGGCAACTATAGGCTGCTAAGGACCATCGGGAAGGGC 
AACTTCGCCAAAGTCAAGCTGGCTCGGCATATCCTCACGGGCCGGGAGGTCGCTATTAAG 
ATCATTGATAAGACCCAGCTGAACCCCAGTAGCTTGCAGAAGCTGTTCAGAGAAGTCCGA 
ATTATGAAGGGACTCAACCACCCCAACATCGTGAAGCTTTTTGAGGTGATAGAGACGGAG 
AAGACGCTATACCTGGTGATGGAATACGCTAGCGCAGGAGAAGTGTTTGACTACCTCGTG 
TCGCACGGCCGCATGAAGGAGAAGGAGGCTCGAGCCAAGTTCCGGCAGATCGTGTCAGCC 
GTGCACTACTGTCATCAGAAGAACATTGTACACAGGGATCTAAAGGCTGAAAACCTGTTG 
CTGGATGCCGAGGCCAACATCAAAATCGCCGACTTCGGCTTCAGCAATGAGTTCACGCTG 
GGCTCCAAGCTGGACACCTTCTGTGGGAGCCCCCCATACGCCGCCCCAGAGCTGTTCCAG 
GGCAAGAAGTATGATGGGCCAGAGGTGGACATCTGGAGCCTGGGTGTCATCCTGTACACG 
CTGGTCAGCGGCTCCCTGCCCTTCGATGGGCACAACCTCAAGGAGCTGCGGGAGCGAGTC 
CTCAGAGGAAAGTACCGGGTCCCCTTCTACATGTCTACAGACTGCGAGAGCATTCTGCGG 
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FIGURE 2Z 

AGATTTCTGGTGCTGAACCCCGCAAAACGCTGTACTCTGGAGCAAATCATGAAAGACAAA 
TGGATCAACATCGGCTATGAGGGTGAGGAGCTGAAGCCAGACACGGAGCTCAAAGAAGAG 
CGG^TGC^GGGTCGGAAAGCGAGCTGCAGTGCAGTGGGCAGTGGAAGTCGAGGCTTG^ 
CCCTCCAGCCCCATGGTCAGCAGTGCCCACAACCCCAATAAGGCAGAGATCCCTGAGC^ 

cSgaaggacagcactagcacccctaacaacctcccccccagcatgatgacccgaagaaac 

ga^tagctccggtacctcgcgggtgccccctgcctcgccttccagtcatagcctggct 
?c^gtcLgcgagcggagccgcctggctcggggctccaccatccgcagcaccttccat 
gggggccaggtccgagaccggcgggcagggagcgggagtggcgggggtgtgcagaatgga 

ScCAGCCTCACCCACGCTTGCCCACGAG^ 

rPCACCACCAACCTCTTCACCAAGCTGACCTCCAAACTGACCCGAAGGGTCACAGACGAA 

SgaSgaa^ggggggLctgaggtca 

ACCGCCCCCAGGCTGCTCCGATTCCCCTGGAGTGTGAAGCTGACCAGCTCGCGACCTTCC 
TGAGGCCCTGATGGCTGCCCTGCGACAGGCCACA 

rTAGCCGGCTTGGCGTGACCGTCGCCTGATC 

ggc?tc??Sc?tcccatggaggtcgggggcttaacagtctttgaagaggaccag^ 
2ctScccagagcctccccttgccagtgtcagcagagggcccagctgcacagaccactgc 

?Sg?ccScaggtcgttttcctcagcccacaga^ 

cagaSSg?caLgcaggacggcgctctctgaaga 

ScactcgSgcccagaatatttgtac^ 

cIcgS?cgtccgaaccgcggggcagtgtgtcctgctgctccctgct^ • 
SSgSgtggtcctcacctctgcttccggcccctgtgtgcaaccctaacaaggccatctt 

SSg?GGA?GcS 

cgggtacagcagccaggacctgattggccagaagctcacgcagttctttctgaggtcaga 

????gaS?gg?ggaggccctcagcgaggagcacatggaggccgacggccacgc^ 

ggtg?^gg?acggtggtggacatcatcacccgtagtggggagaagattccagtgtctgt 

g?ggSgaagaggatgcggcaggagcgccgcctatgctgcgtggtggtcctggagcccgt 

rGAGAGGOTCTCGACCTGGGTCGCTTTCCAGAGCGATGGCACCATCACGTCATGTGACAG 

IScSgatcccttctgtgcagctccctccttctggccagcacatcccaaagaatctc^ 
^Sgaggtctgttggaagag^ 

gaaItcSaacccagcagcgaggaggcgaccaccggtgaggcggcccctgtgagcgg^ 

cc^gcaotgtctgggtgttct 
gacS?cSSgcLcaaccac^ 

rCTCCTGGGCAAGAATATCACTTTCCTGATTCCTGGTTTCTACAGCTACATGGACCTTGC 

gScS?agc^Lagctcccagacctggccagctgcctgg^ 
tcSgtctggggagagaaccttggacccgtggcagggccaggacccagctgaggggggcca 

gga?ccSggStaatgtcgtgcttgctggtggccacg 

rAACCTGATGGAAAGCCAAGACATCTTCACCGGGACTCAGACTGAGCTGATTGCTGGAGG 

gISaSS^ 

tgaaccagtcgatgtgaagccatttgcttcctgcgaagattctgaagctccagtccca^ 

raTrrrAGTCAGTGGTCCCAGCGGTTCAGACCTTTGGGCTGGGGCTGCCGTGGCCAAGCC 

C ^ AG „„^™^^^^r,nr.AAr,rrAGGACTTGGCCCCCAGCCCCTCTGGGATGGCAGG 

rGAGCCGTGGCTGGGAGTGGAAAACGACCGAGA 



CCTCTCGTTTGGGACACCTACTCTAGATC 
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FIGURE 2AA 

AGAGCTGCAGACCTGCTTGATT^GGAGCA^ 
GGATGTCCCCCACGCCGAACTCGTT^^^ 
GTCCTGCGATCTGGGAGGCAGAGACCTGTC^^ 

CTATGCCTTGGCCACGGACCTC^^^ 

TGTGAATTCGTTTTCCTGGAACCT^ 

GTCATCAAATTGTTCCTGTGCTACGTCTGA^ 

GGGCTCCGATCCAGATGTAGGCAGTC^ 

GGAGCTGTTACTACTGACCGGCACCTG^ 

GAGCTGTGTGGGACATGATCCAACAGAACCG^ 

TTATGCAGCAAGCGACAGAGAAAGCCCAGGACA^ TCCAGGT CACCTCCACGCC 
TGAGGACACGTGCCCATCAGCAGAGGAGCCA^GGCTG^ 

CGTGATCGTGATGCGCGGGGCTGCTGGCCTGCAGC^ GGTGAGGCGGG T 
CGGGAGCTGCTACCATCGAGATGGCTTACGG^ 

GGAGCTCCAGGGCCCCACACCTC^ 
CCAACGCGACTCAGCCGCCAGGACCC^ 

CTCTACCGCTGCTGAGCTCACCGGACCCAGCC1 GGCC TGTGAGGGCGAGTA 

GTTTGAGGAGCCCCCCAAGGCTGTGGAACTG^ 

CTCCCAAAAGTACAGTACCATGAGCCCGCTGGGC^ 

^GCTGTGGACAAGGGAAA^C^ 

CTTGGAGGATTGTTGGATTGAGGATCCCA^CT GATAT ATTTGAAAACCAAGG 

TCTATCCAGGGTGGAGCACGCCAATATCATC^ 

GTTCTTCCAGCTTGTGATGGAGA^GCACGGC^^ 

QMCMQMMCTGCaT^^ 



T 1 



GTCTGGGCTGCTGCATC^ 

CCCGTGGGTAACACAGCCTGTGA^TCTTGCTGAC^ 
AAACAAGCCAGAAAGTGGAGTTC^ 

GAGTGATGTGGCCCAGGCTCAGGAGCTTTGTGGG^ CAGC TAAACACCAATTTC 
_„ m ^^o^ n nRATACCTGCACTCCCAAGGCATCAic^ 



A' 



CATTGTGATTGCT^GGACTrCACMTl^^-— GAATRCTGTGCAC CT G AGG 
TAGAOTGGGGCAAACTATTTTATACOTTOTGGAACAA TCTCTGGGGGT CACCC 
TTGTCATTGGAAATGCCTAGAGA^GG^^ 

TGTACACGCTCATCTTGGAGGAGAATCCCTTCIGi^ CTGTCTGGACTG CT G C 
roTTCATCGGCCATrGCTOG^C^A^TGAGT^^^ 

AGCCTTGCCCTCAGCAGCGOACCACTTTGGMAA TGTAGGACC MCCA G GCA G AAA 
AGCCTCT<3AAC<^CTAGCTATACTOGGAAG^ GTAGGAGTCCAAG TGA^TGGCTC 
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TTCAGTTATACTGTACTATTAAA^GGCTTTAAGTT^ ^^^^^^^^^0^^.^^ 
TGTTTAAGTTCTCTATTTrTGTTGCTTCTTmGCTTGTAA TRGACCTTAAAC TGA 
ACCATGTAACTTTGGCTGGCCTGGAAC^ 

cagatctgcctgcgcttgcctcccaagca™^ 
aS^acttggttttctatttctttttgt 

-CAAGC^GCGaGTrCGCC^C^GGCG^G^^^^^^ 



G 



»™tgaaaatgttagac_cacc™^^^ 



TGG TGAACTGGTGGCAACATGGTGTCGCAGCCCCCCTO 
TCTGGAAGGAAGATTCCGGATTCCGTAnTCATGTCAGMGA^ 

TTATTTCTTGTTGGTGGAGCGCCTGA^ 

CTTTGAGGCATTTCAGTCC^CACG^GC^GCAGAGA ATGAATGAC AGCCCCTC 

ssessses^ 



CT 



Iggaattctagagttgaacaaagtgcagttgttgtatg^ 
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FIGURE 2CC 

GCAGCAGCTGCAGGAACATAGGCTCCAGCAGAAGCGACTCTTTCTTCAGAAGCAGTCTCA 

ACTGCAGGCCTATTTTAATCAGATGCAGATAGCAGAGAGCTCCTACCCACAGCCAAGTCA 

GCAGCTGCCCCTTCCCCGCCAGGAGACTCCACCGCCTTCTCAGCAGGCCCCACCGTTCAG 

CCTGACCCAGCCCCTGAGCCCCGTCCTGGAGCCTTCCTCCGAGCAGATGCAATACAGCCC 

TTTCCTCAGCCAGTACCAAGAGATGCAGCTTCAGCCCCTGCCCTCCACTTCCGGTCCCCG 

rGCTGCTCCTCCTCTGCCCACGCAGCTACAGCAGCAGCAGCCGCCACCGCCACCACCCCC 

TCCACCACCACGACAGCCAGGAGCTGCCCCAGCCCCCTTACAGTTCTCCTATCAGACTTG 

TGAGCTGCCAAGCGCTGCTTCCCCTGCGCCAGACTATCCCACTCCCTGTCAGTATCCTGT 

GGATGGAGCCCAGCAGAGCGACCTAACGGGGCCAGACTGTCCCAGAAGCCCAGGACTGCA 

AGAGGCCCCCTCCAGCTACGACCCACTAGCCCTCTCTGAGCTACCTGGACTCTTTGATTG 

TGAAATGCTAGACGCTGTGGATCCACAACACAACGGGTATGTCCTGGTGAATTAGTCTCA 

GCACAGGAATTGAGGTGGGTCAGGTGAAGGAAGAGTGTATGTTCCTATTTTTATTCCAGC 

rTTTTAAATTTAAAGCTTATTTTCTTGCCCTCTCCCTAACGGGGAGAAATCGAGCCACCC 

AACTGGAATCAGAGGGTCTGGCTGGGGTGGATGTTGCTTCCTCCTGGTTCTGCCCCACCA 

CAAAGTTTTCTGTGGCAAGTGCTGGAACATAGTTGTAGGCTGAGGCAGGAGAATGGCGTG 

AACCCGGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCG 

ACTGAGCAAGACTCCACCTCAAAAAAAAAAAAAAAGGACAAGAGCAGTATCATCTGCCTC 

TGTTTCTAAACTGGACAAAGAGATTTTCTTAAAGTTTCTATCATCTCCCTTCTGACAGGT 

TCTACAGTGTGGTCTGAAGCACCTGTAATGTCAGAGCCCTTGTCTGGCCCTTGGTGGCAG 

GTGAACGAAAGCAGTGGAGCCTCTCACCTTCCAGTAGCCTCTCACATTCTTATTTTACCA 

TTTTTGTCCTAATTAAGGTAGCCTAGCTGATTCTAGAAGACAGCCATCCTACGTGCACCC 

CCACCTTGTGTCCACATCTTCTCCAGGCAGGTTTCAACCTATCAGCAGACTCAGGCACAC 

ACTGGGGCACAGATAGAGAACCAGGCGGCAGCAGTGCTCGCAGACCCACCCAGGGAGAGC 

TGTGATGGGTTCTGCCCAGATACTCTGCTCGCCCACCCACAAGGGAGCAATAGCTTATAT 

TTGTACATTAGTTTTACCAAGCACTTTCTCTTCTAACCCTCACAACAATTCTATGAAATT 

AGCTGGGGAGATACTGTCCTTATTTTTCACAGCTGAAGAAACCAAAGCTTTGGGAAGTTT 

GTGACTTCTCTGAGATCACAGCTGGTGATAGAAGGAGCTGGGACACGCGCTTGGGTTGAC 

TGGCTTCTGGTTTTGGTTCTCTGGCTTCTAGTGCTGGAAGAAGCCCTCTCTTTCCCTTCT 

PTTTCCTCAGTAGCATCTGACTCTTTTCATAAGCAAACAGCTGTATAAACAAAGCCCCCA 

TTTTGGTCAAGCACAGGGTGAATGTGATATTGTTCCCACAACCTTATTCTCCACTCAACA 

GCCGCCTGGCTTTGGGGAAGAGGCCGCCTTCAGGTGACAGTGCAGCTGTCCAGGTGGCCG 

TGCACTGAACCAGGCTGAGGGAGACAAAAACCCCGCAGACCCGCCTGCCTTTCAGCGTCC 

AGTTAACTGCAGAAGTTTAGGCTCACCTCAAAGATGTCTAGTTTTTCCAAGTTACAATAC 

AGCAGTTTCCTACAGAACACCCCCTTCCTCAATTGCCAAGGGGCCGCATCGCACGGCATC 

AGGCCACCACTGCAGGCCAGCAGATTCCACCCCAGGAACGGTCATGAACTCAGCCTTTGT 

CTCAACGAGGGGCGTAACATTTCCTTACAGTCAAGCCCCATCAACTAGAAGTGCTTATTA 

CTTTTAGGATTAAAAAAGTAATAACAGACTTTGACTTAATACTCTGTCTTTTCAGAGGCA 

AAGTGGGTGGGTAGAGGGGAGCTTTAAAAATAGAAGTACAAAACAACATCCTGGAAACAT 

ATGACCCCAGATGGAATAATGTCACATTCCCCAGTGCAGATAATGGGCTGCTGCTGGCTC 

TGTGGTGTCTGTCTGCAGAAGATTTGCTCAGTCAAGGAAATTCAAGTGGTGAGACCTTTC 

CACCATGGGTGGTAAGAGAAACCTGCCTTCACCAAAATCTCTGAAGGGGAAAGAAGTGGA 

GAGAAAGGTTTGCTTCACTTCGGGGACTGCAGTTTGAGAAATAAAAGGGATACAGAGATA 

TCTGCACTTTGTAGAAAGGGCAAGATTATTTGCTTATATCTGAAGGGAGGTGGGTGGTTT 

TGCTGGATGTTTGGTCTGAAAGAGTTACTTTTGATAAAGTTAATCTAATTGTAGTTATAT 

TTTCTGTGTGCTTTTTTTTAATTACTAAGAAAAAAATTGGTGAGTTCAGTAGCTTTGGTA 

Satgagtgcaaatcat^^ 

Iotaatgttagaaaattgcctaaaatgcagtgtaataaataatctctgt^ 
S^aaItggggtaattttctgcaaggaaaatgtactgtttttatgtttccaaccctct 

TGA 



5*1 U) 
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SEQ ID NO: 3 9_AA207220_H 

GCTGTGGCTCCCCGTCCTGGTGCGGGACCTGTGCCCCGCGCTTCAGCCCTCCCCGCAAGC 

CTATTGATTCCCCTGCCGCCCTTGCTCCACCTCCTGCTCGCCATGGAGTCGCTGGTTTTC 

GCGCGGCGCTCCGGCCCCACTCCCTCGGCCGCAGAGCTAGCCCGGCCGCTGGCGGAAGGG 

CTGATCAAGTCGCCCAAGCCCCTAATGAAGAAGCAGGCGGTGAAGCGGCACCACCACAAG 

CACAACCTGCGGCACCGCTACGAGTTCCTGGAGACCCTGGGCAAAGGCACCTACGGGAAG 

GTGAAGAAGGCGCGGGAGAGCTCGGGGCGCCTGGTGGCCATCAAGTCAATCCGGAAGGAC 

AAAATCAAAGATGAGCAAGATCTGATGCACATACGGAGGGAGATTGAGATCATGTCATCA 

CTCAACCACCCTCACATCATTGCCATCCATGAAGTGTTTGAGAACAGCAGCAAGATCGTG 

ATCGTCATGGAGTATGCCAGCCGGGGCGACCTTTATGACTACATCAGCGAGCGGCAGCAG 

CTCAGTGAGCGCGAAGCTAGGCATTTCTTCCGGCAGATCGTCTCTGCCGTGCACTATTGC 

CATCAGAACAGAGTTGTCCACCGAGATCTCAAGCTGGAGAACATCCTCTTGGATGCCAAT 

GGGAATATCAAGATTGCTGACTTCGGCCTCTCCAACCTCTACCATCAAGGCAAGTTCCTG 

CAGACATTCTGTGGGAGCCCCCTCTATGCCTCGCCAGAGATTGTCAATGGGAAGCCCTAC 

ACAGGCCCAGAGGTGGACAGCTGGTCCCTGGGTGTTCTCCTCTACATCCTGGTGCATGGC 

ACCATGCCCTTTGATGGGCATGACCATAAGATCCTAGTGAAACAGATCAGCAACGGGGCC 

TACCGGGAGCCACCTAAACCCTCTGATTGCCTGNNTGGCCTGATCCGGTGGCTGTTGATG 

GTGAACCCCACCCGCCGGGCCACCCTGGAGGATGTGGCCAGTCACTGGTGGGTCAACTGG 

GGCTACGCCACCCGAGTGGGAGAGCAGGAGGCTCCGCATGAGGGTGGGCACCCTGGCAGT 

GACTCTGCCCGCGCCTCCATGGCTGACTGGCTCCGGCGTTCCTCCCGCCCCCTCCTGGAG 

AATGGGGCCAAGGTGTGCAGCTTCTTCAAGCAGCATGCACCTGGTGGGGGAAGCACCACC 

CCTGGCCTGGAGCGCCAGCATTCGCTCAAGAAGTCCCGCAAGGAGAATGACATGGCCCAG 

TCTCTCCACAGTGACACGGCTGATGACACTGCCCATCGCCCTGGCAAGAGCAACCTCAAG 

CTGCCAAAGGGCATTCTCAAGAAGAAGGTGTCAGCCTCTGCAGAAGGGGTACAGGAGGAC 

CCTCCGGAGCTCAGCCCAATCCCTGCGAGCCCAGGGCAGGCTGCCCCCCTGCTCCCCAAG 

AAGGGCATTCTCAAGAAGCCCCGACAGCGCGAGTCTGGCTACTACTCCTCTCCCGAGCCC 

AGTGAATCTGGGGAGCTCTTGGACGCAGGCGACGTGTTTGTGAGTGGGGATCCCAAGGAG 

CAGAAGCCTCCGCAAGCTTCAGGGCTGCTCCTCCATCGCAAAGGCATCCTCAAACTCAAT 

GGCAAGTTCTCCCAGACAGCCTTGGAGCTCGCGGCCCCCACCACCTTCGGCTCCCTGGAT 

GAACTCGCCCCACCTCGCCCCCTGGCCCGGGCCAGCCGACCCTCAGGGGCTGTGAGCGAG 

GACAGCATCCTGTCCTCTGAGTCCTTTGACCAGCTGGACTTGCCTGAACGGCTCCCAGAG 

rrCCCACTGCGGGGCTGTGTGTCTGTGGACAACCTCACGGGGCTTGAGGAGCCCCCCTCA 

GAGGGCCCTGGAAGCTGCCTGAGGCGCTGGCGGCAGGATCCTTTGGGGGACAGCTGCTTT 

TCCCTGACAGACTGCCAGGAGGTGACAGCGACCTACCGACAGGCACTGAGGGTCTGCTCA 

AAGCTCACCTGAGTGGAGTAGGCATTGCCCCAGCCCGGTCAGGCTCTCAGATGCAGCTGG 

TTGCACCCCGAGGGGAGATGCCTTCTCCCCCACCTCCCAGGACCTGCATCCCAGCTCAGA 

AGGCTGAGAGGGTTTGCAGTGGAGCCCTGAGCAGGGCTGGATATGGGAAGTAGGCAAATG 

AAATGCGCCAAGGGTTCAGTGTCTGTCTTCAGCCCTGCTGAACGAAGAGGATACTAAAGA 

GAGGGGAACGGGAATGCCCGCGACAGAGTCCACATTGCCTGTTTCTTGTGTACATGGAGG 

GGCCACAGAGA 

SEO ID NO: 40 AA426580_H, MAK_V_H 

ATGCCGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGCGGCGCCTGGGGGCGGCGGC 
GGCGCGGAGGACGCGGCCAGGCCCGCGGCGGCCTGCGAGGGAAGTTTCCTGCCTGCCTGG 
GTGAGCGGCGTGCCCCGCGAGCGGCTCCGCGACTTCCAGCACCACAAGCGCGTGGGCAAC 
TACCTCATCGGCAGCAGGAAGCTGGGCGAGGGCTCCTTTGCCAAGGTGCGCGAGGGGCTG 
CACGTGCTGACCGGGGAGAAGGTGGCCATAAAAGTCATTGATAAGAAGAGAGCCAAAAAG 
GACACCTATGTCACCAAAAACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCAC 
CCCAATATCACTCAGCTCCTTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATG 
GAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGGAGGAG 
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TCCGAAGCCCGCAGATACATCCGACAGCTCATCTCTGCCGTAGAGCACCTGCACCGGGCC 

GGGGTGGTCCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 

AAGCTGATTGACTTTGGTTTGAGCAACTGCGCAGGGATCCTGGGTTACTCGGATCCGTTC 

AGCACACAGTGTGGCAGCCCTGCCTACGCTGCACCTGAACTGCTCGCCAGGAAGAAATAC 

GGCCCCAAAATCGATGTCTGGTCCATAGGTGTGAACATGTATGCCATGTTGACCGGGACG 

CTGCCTTTCACGGTGGAGCCTTTCAGCCTGAGGGCTTTGTACCAGAAGATGGTAGACAAA 

GAAATGAACCCCCTCCCCACTCAGCTCTCCACAGGTGCCATCAGTTTCCTGCGCTCTCTC 

CTGGAACCGGATCCTGTGAAGAGGCCAAATATTCAGCAGGCACTGGCGAATCGCTGGCTT 

AATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAACAGGATTTCTCTG 

GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 

GACGTGATCAACACTGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTC 

TTAAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCCAGGACAGCCTCTGC 

TACAAGACCCGGCTCTACCAGATAGAAAAGTACAGGGCCCCCAAGGAGTCCTATGAGGCC 

TCTCTGGACACCTGGACACGAGATCTTGAATTCCATGCCGTGCAGGATAAAAAGCCCAAA 

GAACAAGAAAAAAGAGGGGATTTTCTTCATCGACCATTCTCCAAGAAGTTGGACAAGAAC 

CTGCCCTCGCACAAACAGCCCTCAGGCTCGCTTATGACACAGATTCAGAACACCAAAGCC 

CTCCTGAAGGACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCTGC 

CGCAATATTTTCCGCAAAACCTCAGATTCCAATTGTGTGGCTTCTTCTTCCATGGAGTTC 

ATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCCCATCAGCCA 

GGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGATGCTGGACATGGTGCGC 

TCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTCTCCCTCTCATCACTAC 

AGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAGCGAGAGGACGCTGTCC 

CCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTTGCATCCAACTCTGGTC 

TCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGGCCTGTGTTGCCCACCT 

CCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAATTGTGTGAAAAGCCGA 

GGCCGGTTCCCTATGATGGGCATCGGACAGATGTTAAGGAAGCGCCATCAGAGTCTGCAG 

CCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCAGCCCCTAGCCCCTGTG 

AACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTGCTAA 

SEQ ID NO: 4 1_Z3 672 0_H 

ATGGACACAAAGCTGAACATGCTGAACGAGAAGGTGGACCAGCTCCTGCACTTCCAAGAA 
GATGTCACAGAGAAGTTGCAGAGCATGTGCCGAGACATGGGCCACCTGGAGCGGGGCCTG 
CACAGGCTGGAGGCCTCCCGGGCACCGGGCCCGGGCGGGGCTGATGGGGTTCCCCACATT 
GACACCCAGGCTGGGTGGCCCGAGGTCCTGGAGCTGGTGAGGGCCATGCAGCAGGATGCG 
GCCCAGCACGGTGCCAGGCTGGAGGCCCTCTTCAGGATGGTGGCTGCGGTGGACAGGGCC 
ATCGCTTTGGTGGGGGCCACGTTCCAGAAATCAAAGGTGGCGGATTTCCTCATGCAGGGG 
CGTGTGCCCTGGAGGAGAGGCAGCCCAGGTGACAGCCCTGAGGAGTGGGTAAAAGAGGAG 
GAGGTCTGTTTCATGCCTCCAGTTCCCCCAGCTCCGGGGGCAGCAGGACAGAGCCTGCAG 
AAGGATAAGGGGGAGCTGTCTGCCGAGCAGGGGATCTGGGCCACATTGATGACGCTGGTG 
ATCATGGTGACAGCGGCAAATAAAGAGCGAGTGGAAGAAGAGGGAGGAAAACCAAAGCAT 
GTGCTGAGCACCAGTGGGGTGCAGTCTGATGCCAGGGAGCCTGGGGAAGAGAGCCAGAAG 
GCGGACGTGCTGGAGGGGACAGCGGAGAGGCTGCCCCCCATCAGAGCGTCAGGGCTGGGA 
GCTGACCCCGCCCAGGCAGTGGTCTCACCGGGCCAGGGAGATGGTGTTCCTGGCCCAGCC 
CAGGCATTCCCTGGCCACCTGCCCCTGCCCACAAAGGTGGAAGCCAAGGCTCCTGAGACA 
CCCAGCGAGAACCTCAGGACTGGCCTGGAATTGGCTCCAGCACCCGGCAGGGTCAATGTG 
GTCTCCCCGAGCCTGGAGGTTGCACCAGGTGCAGGACAAGGAGCATCGTCCAGCAGGCCT 
GACCCTGAGCCCTTAGAGGAAGGCACGAGGCTGACTCCAGGGCCTGGCCCTCAGTGCCCA 
GGGCCTCCAGGGCTGCCAGCCCAGGCCAGGGCAACCCACAGTGGTGGAGAAACACCTCCA 
AGGGCAGCCCTGCTGAAGGGCGCTGTGGCCCCGGGCTTCTCTCGGAGGGACCTGGTGTTT 
CCTAGCATCTTCTGCGCCTGCCTAGGGATCTCCATCCACATACAAGAGATGGATACTCCT 
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GGGGAGATGCTGATGACAGGCAGGGGCAGCCTTGGACCCACCCTCACCACAGAGGCTCCA 

GCAGCTGCCCAGCCAGGCAAGCAGGGCCCACCTGGGACCGGGCGCTGCCTCCAAGCCCCT 

GGGACTGAGCCCGGAGAACAGACCCCTGAAGGAGCCAGAGAGCTCTCCCCGCTGCAGGAG 

AGCAGCAGCCCCGGGGGAGTGAAGGCAGAGGAGGAGCAAAGGGCTGGGGCCGAGCCTGGC 

ACGAGACCAAGCTTGGCCAGGAGTGACGACAATGACCACGAGGTTGGGGCCCTGGGCCTG 

CAGCAGGGCAAAAGCCCAGGGGCGGGAAACCCTGAGCCTGAGCAGGACTGTGCAGCCAGG 

GCTCCGGTGAGAGCTGAAGCAGTAAGGAGGATGCCCCCAGGCGCCGAGGCTGGCAGCGTG 

GTTCTGGATGACAGTCGGGCCCCACCAGCTCCTTTTGAACACCGGGTAGTGAGCGTCAAG 

GAGACCTCCATCTCTGCGGGTTACGAGGTGTGCCAGCACGAAGTCTTGGGAGGGGGTCGG 

TTTGGCCAGGTCCACAGGTGCACAGAGAAGTCCACAGGCCTCCCACTGGCTGCCAAGATC 

ATCAAAGTGAAGAGCGCCAAGGACCGGGAGGACGTGAAGAACGAGATCAACATCATGAAC 

CAGCTCAGCCACGTGAACCTGATCCAGCTCTATGACGCCTTCGAGAGCAAGCACAGCTGC 

ACCCTTGTCATGGAGTACGTGGACGGGGGTGAGCTCTTCGACCGGATCACAGATGAGAAG 

TACCACCTGACTGAGCTGGATGTGGTCCTGTTCACCAGGCAGATCTGTGAGGGTGTGCAT 

TACCTGCACCAGCACTACATCCTGCACCTGGACCTCAAGCCGGAGAACATATTGTGCGTC 

AATCAGACAGGACATCAAATTAAGATCATTGACTTTGGGCTGGCCAGAAGGTACAAGCCT 

CGAGAGAAGCTGAAGGTGAACTTCGGCACTCCTGAGTTCCTGGCCCCAGAAGTCGTCAAT 

TATGAGTTTGTCTCATTCCCCACAGACATGTGGAGTGTGGGAGTCATCACCTACATGCTA 

CTCAGTGGCTTGTCCCCATTTCTAGGGGAAACAGATGCAGAGACCATGAATTTCATTGTA 

AACTGTAGCTGGGATTTTGATGCTGACACCTTTGAAGGGCTCTCGGAGGAGGCCAAGGAC 

TTTGTTTCCCGGTTGCTGGTCAAAGAGAAGAGCTGCAGAATGAGTGCCACACAGTGCCTG 

AAACACGAGTGGCTGAATAATTTGCCTGCCAAAGCTTCAAGATCCAAAACTCGTCTCAAA 

TCCCAACTACTGCTGCAGAAATACATAGCTCAAAGAAAATGGAAGAAACATTTCTATGTG 

GTGACTGCTGCCAACAGGTTAAGGAAATTTCCAACTTCTCCCTAA 

SEO ID NO: 42 SGK088 H 

GGGGAGATGGCGCTGTTTGAGTGCCTGGTGGCGGGGCCCACTGACGTGGAGGTGGATTGG 

CTGTGCCGTGGCCGCCTGCTGCAGCCTGCACTGCTCAAATGCAAGATGCATTTCGATGGC 

CGCAAATGCAAGCTGCTACTTACATCTGTACATGAGGACGACAGTGGCGTCTACACCTGC 

AAGCTCAGCACGGCCAAAGATGAGCTGACCTGCAGTGCCCGGCTGACCGTGCGGCCCTCG 

TTCGCACCCCTGTTCACACGGCTGCTGGAAGATGTGGAGGTGTTGGAGGGCCGAGCTGCC 

CGTTTCGACTGCAAGATCAGTGGCACCCCGCCCCCTGTTGTTACCTGGACTCATTTTGGC 

TGCCCCATGGAGGAGAGTGAGAACTTGCGGCTGCGGCAGGACGGGGGTCTGCACTCACTG 

CACATTGCCCATGTGGGCAGCGAGGACGAGGGGCTCTATGCGGTCAGTGCTGTTAACACC 

CATGGCCAGGCCCACTGCTCAGCCCAGCTGTATGTAGAAGAGCCCCGGACAGCCGCCTCA 

GGCCCCAGCTCGAAGCTGGAGAAGATGCCATCCATTCCCGAGGAGCCAGAGCAGGGTGAG 

CTGGAGCGGCTGTCCATTCCCGACTTCCTGCGGCCACTGCAGGACCTGGAGGTGGGACTG 

GCCAAGGAGGCCATGCTAGAGTGCCAGGTGACCGGCCTGCCCTACCCCACCATCAGCTGG 

TTCCACAATGGCCACCGCATCCAGAGCAGCGACGACCGGCGCATGACACAGTACAGGGAT 

GTCCATCGCTTGGTGTTCCCTGCCGTGGGGCCTCAGCACGCCGGTGTCTACAAGAGCGTC 

ATTGCCAACAAGCTGGGCAAAGCTGCCTGCTATGCCCACCTGTATGTCACAGATGTGGTC 

CCAGGCCCTCCAGATGGCGCCCCGCAGGTGGTGGCTGTGACGGGGAGGATGGTCACACTC 

ACATGGAACCCCCCCAGGAGTCTGGACATGGCCATCGACCCGGACTCCCTGACGTACACA 

GTGCAGCACCAGGTGCTGGGCTCGGACCAGTGGACGGCACTGGTCACAGGCCTGCGGGAG 

CCAGGGTGGGCAGCCACAGGGCTGCGTAAGGGGGTCCAGCACATCTTCCGGGTCCTCAGC 

ACCACTGTCAAGAGCAGCAGCAAGCCCTCACCCCCTTCTGAGCCTGTGCAGCTGCTGGAG 

CACGGCCCAACCCTGGAGGAGGCCCCTGCCATGCTGGACAAACCAGACATCGTGTATGTG 

GTGGAGGGACAGCCTGCCAGCGTCACCGTCACATTCAACCATGTGGAGGCCCAGGTCGTC 

TGGAGGAGCTGCCGAGGGGCCCTCCTAGAGGCACGGGCCGGTGTGTACGAGCTGAGCCAG 

CCAGATGATGACCAGTACTGTCTTCGGATCTGCCGGGTGAGCCGCCGGGACATGGGGGCC 



PCTAJSOO/14842 

WO 00/73469 

FIGURE 2GG 

GCAGAGGCCCCTCGGTTTGAGTCCAT^G^GACGT TCATGTGGTACA AGGAC 
GCTCGCTTTGCGGTGGTGGTCGAGGGAA^C^ 

GAGGTGCTGCTGACCGAGAGCAGCCATGTGAGUliu GCACCGCC CAGAAC 
CTGGTGGTGCTCAGCACGGGGGCCCAGGATGGAGGCGTCT 

CTGGCGGGTGAGGTCTCCTGCAAAGCAGAGTT^ 

Lggtcgagggggtcggggaggatgagga^ 

GACATCCACCAGGAGATCGGCAGGGGTGCTTT^ 
CCTGAGAAGCTGCTCGTGTGGGATGGTGC^ 

CTGAGCCTGAGCAGGGAGGCCCGGGGCTTCCT^i^ ^^ ctcaggc ^ gggcgca 



agaggtaccg^ 



gtga gcacggatcacctgaagct^cgtct™ 



ACCCCAGAGACTGGGGCTGC^CCCCCATGGACTG ^.^csrgcccGCAGCT 

CAGGACCAGGAGGCTCCCAGCCCAGAGGCCCTCCCC1L TGASAGCGCC CTGCCC 

CGGGCCGGGCCGCGGGAGCTGGGCCGGGGCCTGCAC^ 

CAGCGCCGGAGCCCCGGCCCGGGAGCCACCCGCL1 GGGAGGCCC CGAGGAT 

SgtatgcccagaggctgcaggOT 

— aaggtcagcggcctcaggggtcccctgctggag CAGCCCCCAC tcgag 



ccc^gat^cacgagctgcctccagcgaggcagcgccc 

AACCGGGGCCTGCAAAAGAGCAGCAGCTTCTCC^ GCTTGGGGC CCGTAGGCTA 
CACCGCCGAGCGGGGGCGCCCCTCGAGATCCCCGT 

U3TCGGAGGCCGTGTTCGAGGCC 



GTGCTAGCCGAGAAAGCCCGAGTTCC^CGGTGCCCC^ 
AGCAGCATCGAAAACTOGAGTC^^^ 



PCTAJS00/14842 

WO 00/73469 

FIGURE 2HH 

S^TCTCTGCCAGCGTCCAGGAGGAGTTGGGTCACCAGTACCT 
TCACTGGCCCCACCCCTAGCTCCTGCTGCCCCCA^ 

ScCTGATGAAGCTGCGCCGCCAGACGCT^^ 
SccScCAATGCCACGGGACATTCCAGGGGC^^ 

II s ) 



WO 00/73469 
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FIGURE 211 

GAGAGCTGGAGGCCAGCAGTCACTCACACTCGCTCTGTCCTCCTGTCCAGTGGATACAGC 
CCTCGGCGCTCTGCTGGCCCAAGGATGTCCCCACTGCCCCTCCATGGCCTTTGGCCTTCT 
TCCCATTCATATTTATTTATTTATTGACTTTTATGAAGTTTCCCCTTCCATCCGATCCCT 
ACTGCCCATGTTGTCCTGACCATCCCTCCCAGCCATCCAGCTGTCTGTCTGTCTGCCACA 

AGGAAATAAAAATGGCAAGCAGCAAAAAAA 
9FO ID NO- 43 AA542015 M SGK088_M 

GCCACGGACATCTGGGGAGCGG5TGTGCTCACCTACATCATGCTTAGTGGGTACTCCCCA 
TTCTATGAGCCAGACCCCCAGGAAACAGAGGCTCGGATTGTTGGGGGTCGCTTTGATGCC 
TTCCAGTTGTATCCTAACACATCCCAGAGTGCCACCCTCTTCTTGAGAAAGGTCCTCTCA 
GTACATCCCTGGAGCCGGCCCTCTCTGCAGGACTGCTTGGCCCACCCATGGCTGCAAGAT 
GCCTACCTGATGAAGCTGCGCCGCCAGACACTCACCTTCACCACCAACCGGCTCAAGGAA 

??StLgcgagcagcggcgacgtcgggctgaggctgctacccgtcacaaggtgctc 

rGCTCCTACCCTGGCAGCCCCTAGGTGGCACAGACCGCAGCCCGGCCACGGGCTTCAACT 
TGGGTTCTCACTCGCGCTGCCAAGGGACATTCCAGAGCCCATGCTGAGCTGGACAGGCAG 

gggStcagataccagcagcagcagcagcagcagcagcagcaacatctggctgggc^ 

acctStggacctaagaggacaaggccctggggcttcagccgaatgtcaccccgg^ 

accagagcaggagacccactggccaggctgggcaagggtgagagcagaaagaggcaaaga 

gggaStgggaagtgaagaatgagacggaggatagagagggaggagtttgaggaag^ 

taggctggagtggaatgctatatctcagggagaagccagaaggggacatggctgaagag^ 

SgaagLccctgtgatgtgggaatgtggtggagaggaggactggacatagagagtgtgc 

^g^cagagcagagacataagggagggcagaagggtagaaggcaacaggagt^ 

Sggggtggcagggcaggccagcagctgcatcttcagaaagagagaggagaaaggcaaag 

agacgaaaggccgctccagctggtctcctgtcccagccgatgcagttctgggcgttctcc 

actggctcagggatgtcctcactgctcctccatggcctttgccctccttcccatttgtat 

tcccgaccatcccccttcagtcacccagctgtctgtgcagctgtctgtctgtctgtcaca 
aggaaaataaaacaaaacaaaacaaaacaaaacaaacaaaaaaaaaacagc 

cpn td NO- 44 R19772 H 

a?gaagggcggcgIcagggc5tacacccgaggtccctctttggggtggctctttgctaag 
tc??gc?gttgcttcccgtgtagagatgcatactctcattcctcaagcgagaatggaggc 
Sg^SaSccgtagccaacctgcaggcccagccctccctgaacttca 
Sgggt?ccaagcgctccaccaacactcttaagaagtggctgacgagtcctgtgcgt^ 

cSScagcgggaaggcagatggaaacatcaaaaagcagaagaaag 

Sga^ct?tgacctgggatctcccaagcctggggatgaaacaacccctcagggaga^ 

gctgatgagagcaagaaaggttggggtgaagatgagccggatgaagagtcacacacaccc 

Sc2cI?cacctatgaagatttttg^^^ 
SgcSgcagcccggcaggcttccactgaagtacctactgct^ 

ISgaaa^?^ 

aa^ga^tgcaggctgcctgaatgaggggatggccccacccacacctcctaaaaatcca 
gaag^gaacagaaagccaaggccctgagaggcaggatgtttgtcctgaatgagctggta 

r^ActGAGAAAGACTATGTCAAGGATCTGGGCATTGTGGTGGAGGGCTTCATGAAGAGA 

aSca^ca^ttatgactggcataaggattttttcctggcggaactggaaa^^ 
cIggagcaagacagattggcacagctctttattaagcacgagcggaagctgcac 
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FIGURE 2JJ 

AAACGCTGCAATGACATGATGAATCTAGGACGTCTGCAGGGCTTTGAGGGCACTCTGACT 

GCTCAGGGGAAGCTACTGCAGCAGGACACATTCTATGTGATCGAGCTGGATGCAGGCATG 

CAGTCCCGGACCAAAGAGAGGCGCGTGTTCCTCTTCGAGCAGATTGTCATCTTCAGTGAA 

CTGCTCAGGAAGGGATCCCTCACCCCTGGCTACATGTTCAAAAGGAGCATCAAGATGAAT 

TACTTGGTCCTGGAGGAGAATGTGGACAATGATCCCTGCAAGTTTGCACTCATGAACAGA 

GAGACTTCTGAGAGGGTTGTTCTGCAAGCCGCCAACGCTGACATCCAGCAGGCCTGGGTG 

CAGGACATCAATCAAGTCTTAGAAACACAGCGAGACTTTTTGAATGCACTGCAATCGCCC 

ATTGAGTATCAACGGAAAGAAAGGAGCACAGCTGTGATGAGGTCTCAACCTGCCAGGCTT 

CCCCAAGCCAGCCCCAGGCCCTACTCCTCTGTTCCTGCGGGCTCAGAGAAGCCCCCAAAG 

GGCTCCAGCTATAACCCACCTCTGCCTCCCCTGAAGATATCTACCTCCAATGGCAGTCCA 

GGGTTTGAATACCACCAGCCTGGGGACAAGTTCGAAGCCAGCAAGAACGACCTGGGAGGC 

TGCAATGGGACCTCGTCCATGGCCGTGATCAAAGATTACTATGCACTGAAGGAGAATGAA 

ATCTGTGTGAGCCAAGGTGAGGTGGTCCAGGTCCTCGCCGTCAACCAGCAGAACATGTGT 

CTGGTGTACCAGCCTGCCAGCGACCATTCCCCCGCCGCCGAGGGCTGGGTCCCAGGCAGC 

ATCCTGGCGCCCCTCACCAAAGCCACAGCAGCAGAAAGTAGTGACGGGAGCATCAAGAAG 

TCATGTTCATGGCATACTCTACGCATGAGAAAGCGGGCGGAAGTGGAGAACACGGGTAAA 

AATGAAGCCACAGGGCCTCGTAAACCCAAGGATATTCTGGGCAACAAAGTCTCTGTTAAA 

GAGACGAACAGTTCCGAGGAATCAGAGTGTGATGATCTTGACCCTAATACTAGCATGGAG 

ATCTTAAATCCAAATTTCATCCAAGAAGTGGCCCCAGAATTCCTTGTGCCCTTGGTGGAT 

GTGACCTGCTTGCTTGGGGACACAGTGATACTGCAGTGCAAAGTCTGTGGGCGGCCAAAG 

CCCACCATCACTTGGAAGGGTCCAGACCAGAACATCCTTGACACTGATAACAGCTCAGCC 

ACATACACGGTCTCCTCTTGTGATTCTGGAGAAATCACCCTGAAGATCTGTAATCTGATG 

CCCCAAGACAGTGGGATTTATACCTGCATAGCAACAAATGACCACGGGACCACATCAACG 

TCTGCAACAGTCAAAGTGCAAGGTGTTCCAGCAGCCCCTAACCGCCCCATTGCCCAGGAG 

AGAAGCTGCACCTCCGTGATTCTCCGCTGGCTGCCCCCCTCCAGCACAGGAAACTGCACT 

ATTTCTGGTTACACTGTGGAGTACAGAGAGGAAGGTTCTCAGATCTGGCAGCAGTCAGTG 

GCTTCGACCTTGGACACTTACCTCGTCATCGAAGACCTTAGTCCCGGGTGTCCTTATCAG 

TTCAGAGTCAGTGCCAGTAACCCCTGGGGAATCAGCCTTCCCAGCGAGCCCTCGGAGTTT 

GTGCGACTTCCAGAATACGATGCTGCTGCTGATGGTGCCACCATTTCTTGGAAGGAAAAT 

TTTGACTCAGCTTACACTGAGCTGAATGAAATTGGAAGAGGCCGTTTCTCTATAGTAAAG 

AAATGCATTCACAAAGCTACCCGCAAAGATGTGGCTGTGAAATTTGTTAACAAAAAAATG 

AAGAAGAAAGAACAGGCTGCCCACGAGGCTGCCCTGCTTCAGCACCTACAGCACCCCCAG 

TACATCACTCTCCATGACACCTATGAGTCCCCCACATCCTACATCCTGATCTTGGAACTG 

ATGGATGATGGCCGGCTCTTAGACTACCTTATGAATCATGATGAACTGATGGAGGAAAAA 

GTAGCTTTCTATATCCGAGACATCATGGAGGCTCTGCAGTACCTTCACAACTGCAGGGTT 

GCACATTTGGACATAAAGCCTGAAAACCTGCTCATTGACCTACGGATTCCAGTGCCTCGA 

GTGAAGCTCATTGACTTGGAGGATGCTGTCCAGATCTCGGGTCACTTCCACATTCACCAC 

CTGCTGGGGAACCCTGAGTTTGCTGCCCCAGAAGTCATTCAAGGCATCCCCGTCTCCCTG 

GGGACAGACATCTGGAGCATCGGGGTTCTGACATATGTCATGCTGAGTGGGGTCTCCCCC 

TTCTTGGATGAGAGCAAAGAGGAGACATGTATCAACGTATGCAGGGTGGATTTCAGCTTC 

CCCCATGAATACTTCTGTGGTGTGAGCAATGCTGCCAGAGATTTCATCAATGTGATCTTA 

CAGGAAGATTTTCGGAGGCGGCCCACAGCAGCCACATGCTTGCAGCATCCATGGCTGCAG 

CCCCATAATGGCAGCTACTCTAAGATCCCCCTGGACACCTCCCGCCTAGCATGCTTCATA 

GAACGTCGCAAGCACCAGAATGATGTGCGGCCTATCCCCAATGTCAAGAGCTACATTGTC 

AACCGGGTGAACCAAGGGACGTAG 
SEO ID NO: 4 5 5R72 8 2 H 

CGCCGCTGTTTGTCCTCGCGCGGCCCCGTCCACTGCCCTGCGGTTGCTCTGCGGGCTGAA 
AAGTTTCTCCCGGTGCAGAATTCCGGGCTCAGCGACAGCCTGCGCCGAGTGTGCGCACCT 
GTCGGAGACCCGCCAGTCCGCCGGCCCCGGCTTTGTTCGTGCGGAACTGTAGTGGTGAGA 
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FIGURE 2KK 

AAAACTCCATGTCTGGGCACGCCTGGCTGATCTTCACCTCTTTCTTCTAGGACCTTCCTC 
TGGGCTGTCACGTGTGAATATGTGTCTAGTGCATCCTTAACCTGAGGACTTCACCAGTTC 
GAAATTACAGTTTTCACCATCAACTACCTTATCCTTTTTGGCCTGGTTTTCTTCCTCAAA 
CAGTGGAAACATTTTTAAAGTTGCTTTTGTTGCAGAGTTAAACAAATGGCTGATAGTGGC 
TTAGATAAAAAATCCACAAAATGCCCCGACTGTTCATCTGCTTCTCAGAAAGATGTACTT 
TGTGTATGTTCCAGCAAAACAAGGGTTCCTCCAGTTTTGGTGGTGGAAATGTCACAGACA 
TCAAGCATTGGTAGTGCAGAATCTTTAATTTCACTGGAGAGAAAAAAAGAAAAAAATATC 
AACAGAGATATAACCTCCAGGAAAGATTTGCCCTCAAGAACCTCAAATGTAGAGAGAAAA 
GCATCTCAGCAACAATGGGGTCGGGGCAACTTTACAGAAGGAAAAGTTCCTCACATAAGG 
ATTGAGAATGGAGCTGCTATTGAGGAAATCTATACCTTTGGAAGAATATTGGGAAAAGGG 
AGCTTTGGAATAGTCATTGAAGCGACAGACAAGGAAACAGAAACGAAGTGGGCAATTAAA 
AAAGTGAACAAAGAAAAGGCTGGAAGCTCTGCTGTGAAGTTACTTGAACGAGAGGTGAAC 
ATTCTGAAAAGTGTAAAACATGAACACATCATACATCTGGAACAAGTATTTGAAACGCCA 
AAGAAAATGTACCTTGTGATGGAGCTTTGTGAGGATGGAGAACTCAAAGAAATTCTGGAT 
AGGAAAGGGCATTTCTCAGAGAATGAGACAAGGTGGATCATTCAAAGTCTCGCATCAGCT 
ATAGCATATCTTCACAATAATGATATTGTACATAGAGATCTGAAACTGGAAAATATAATG 
GTTAAAAGCAGTCTTATTGATGATAACAATGAAATAAACTTAAACATAAAGGTGACTGAT 
TTTGGCTTAGCGGTGAAGAAGCAAAGTAGGAGTGAAGCCATGCTGCAGGCCACATGTGGG 
ACTCCTATCTATATGGCCCCTGAAGTTATCAGTGCCCACGACTATAGCCAGCAGTGTGAC 
ATTTGGAGCATAGGAGTCGTAATGTACATGTTATTACGTGGAGAACCACCCTTTTTGGCA 
AGCTCAGAAGCGAAGCTTTTTGAGTTAATAAGAAAAGGAGAACTACATTTTGAAAATGCA 
GTCTGGAATTCCATAAGTGACTGTGCTAAAAGTGTTTTGAAACAACTTATGAAAGTAGAT 
CCTGCTCACAGAATCACAGCTAAGGAACTACTAGATAACCAGTGGTTAACAGGCAATAAA 
CTTTCTTCGGTGAGACCAACCAATGTATTAGAGATGATGAAGGAATGGAAAAATAACCCA 
GAAAGTGTTGAGGAAAACACAACAGAAGAGAAGAATAAGCCGTCCACTGAAGAAAAGTTG 
AAAAGTTACCAACCCTGGGGAAATGTCCCTGAGACCAATTACACTTCAGATGAAGAGGAG 
GAAAAACAGTCTACTGCTTATGAAAAGCAATTTCCTGCAACCAGTAAGGACAACTTTGAT 
ATGTGCAGTTCAAGTTTCACATCTAGCAAACTCCTTCCAGCTGAAATCAAGGGAGAAATG 
GAGAAAACCCCTGTGACTCCAAGCCAAGGAACAGCAACCAAGTACCCTGCTAAATCCGGC 
GCCCTGTCCAGAACCAAAAAGAAACTCTAAGGTTCCCTCCAGTGTTGGACAGTACAAAAA 
CAAAGCTGCTCTTGTTAGCACTTTGATGAGGGGGTAGGAGGGGAAGAAGACAGCCCTATG 
CTGAGCTTGTAGCCTTTTAGCTCCACAGAGCCCCGCCATGTGTTTGCACCAGCTTAAAAT 
TGAAGCTGCTTATCTCCAAAGCAGCATAAGCTGCACATGGCATTAAAGGACAGCCACCAG 
TAGGCTTGGCAGTGGGCTGCAGTGGAAATCAACTCAAGATGTACACGAAGGTTTTTTAGG 
GGGGCAGATACCTTCAATTTAAGGCTGTGGGCACACTTGCTCATTTTTACTTCAAATTCT 
TATGTTTAGGCACAGCTATTTATAGGGGAAAACAAGAGGCCAAATATAGTAATGGAGGTG 
CCAAATAATTATGTGCACTTTGCACTAGAAGACTTTGTTAGAAAATTACTAATAAACTTG 
CCATACGTATTACAGCAGAAGTGCTTCAGTCATTCACATGTGTTCGTGAGATTTTAGGTT 
GCTATAGATTGTTTAAGACAGCTTATTTTAAATGTAGAAAAATAGGAGATTTTGTAACTG 
CTTGCCATTAACTTGCTGCTAAATTCCCAATGTATTGATTAAATCAATAAAAAACAGATG 

TTACTC 

SEQ ID NO: 46_SGK309_H 

GGGTCCGCAGCCCGCCCTCACAGGCCCTCCTCACTCCCCTAGGTAGATGGCCCCCTCAGG 
GCAGGCCCGGCGGACACCCCTCCCTCTGGCTGGCGGATGCAGTGCCTAGCGGCCGCCCTT 
AAGGACGAAACCAACATGAGTGGGGGAGGGGAGCAGGCCGACATCCTGCCGGCCAACTAC 
GTGGTCAAGGATCGCTGGAAGGTGCTGAAAAAGATCGGGGGCGGGGGCTTTGGTGAGATC 
TACGAGGCCATGGACCTGCTGACCAGGGAGAATGTGGCCCTCAAGGTGGAGTCAGCCCAG 
CAGCCCAAGCAGGTCCTCAAGATGGAGGTGGCCGTGCTCAAGAAGTTGCAAGGTTCGGGC 
CTCGGGCAGGGGGATGGGAAGGAAGAGATGATGAAGCCAGGGGCTAAGAGAGGGAAGGAC 
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FIGURE 2LL 

rATGTGTGCAGGTTCATTGGCTGTGGCAGGAACGAGAAGTTTAACTATGTAGTGATGCAG 
ArrACCACATTGCGGCTGGGCAAGCAGATCTTGGAGTCCATCGAGGCCATCCACTCTGTG 

ggcttotgSccgtgacatcaagccttcaa^ 
?aSggSgScStLgctggacttcgggctggcccggcagtac^^ 

gatg?g^g?cc?ctcggaatgtggccggg 
^gScacSgaaccgggagatgggccgccacgac^ 
c^ggSSg^agtgggccagctgccctggaggaaga^ 
gggSgSSIggIgaagtatgagcaccggatg 

cacctcttcc^ggac^cattgccagcctcgactacttcaccaagcccgactaccact 

SSggSg^ggcaggcaccgatgccctcctgtccacgagcacctctacc 
gcagSScc^ggcagacggcagccatgtttggggtggtcaatgtg^ 

rrACCTGCTCCGGGAGAACACCGCGGATGTGCTACAGGGAGAGCACCTGAGTGACCAGGA 
TG^CCCcScCC^GGGGTCCTGAGGCTGAAGTCTGGGAGGAGACAGATGTCAACCGGAA 

caa^ScSgatcaIcatcggcaaagtaactgccgccagggcgaaggg 

tcSg?^^ 

ggcgcgcca4atatca*cacgt^ 
SgcagSgctggcac^ 

Sr^rGCCGCCGCCGCCG^ 

??cggccSggSgccgcgc5cg^ 
gSgcgSccc™^^ 

rrrrTCCTCCAGAAACCCCTGCGTCCGCCACGGCCCAGGTTAAATGGAAACCACCCTTGG 

gaaSgSScI^ 

r^CAGCTGGATATCCTGAGTGTTGGAATCCTAGTGAAAGAAAGATGGAAAGTGTTGAGA 

?a?attgggggtcggg^ 

ttggagtctattgaaagcattcattctgtgggatcttgncatcgagacatcaaaccgtcg 

^ggc^gIcS?ttaccaattcctgtggtgacg 
™Sggg^^ 

rTCATGTTGAAACATCTCCCTCCAGAATTCAGCATCTTTCTAGACCATATCTCTTCTTTG 
CATTATTTT^CAAAACCAGACTACC^ 

^^^^rPAATGCTACTCCCATCCCTGGAGACTTGCTTCGAGAAAATACAGATGAG 
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GTCGGGTGAGAGATTAGTCAGCGAGAGA^ 

TTTTACAGTAACTTTGTTTAGRAAGTMTCTCTTCtA GG A GGGG AAAAAGT 

SEQ ID NO: 48 - AA4 ^^^-" rTGTTTTTG ATTTAGAGGCGAAATCAGCACGTGGT 
AC TTTTACTATATTCTTTGAGATGA^ 
GGCTCAAATCTCCTTATGGATAGTGTTTCTO^ 
CGGGGCCTGGCGTACATCCACCACCAAC^CGTTCTTC^ 

TTACTCATCAGTCACCTGGGAGAGCTCAAACT 

GGCTTTGCCAGAGACCGCGTCTCCGCCCAGGMGCAG^CRTG^ c^oasTG' 

Sgccatctcagctgtaccagcttcctgatgaggagtctttc^ 
Lgctaaagccagaaatgtgtgaccttttggcctcctac« 

CAGTTTAGCAAATGCTGGTGAAAAGAAAGGGCGAGATCACCAA ^tctmcTCC 
^TCTGCAGTTTCGGTTTTCATTTGCTTOGCTTACTAAGAAGC 



ATTTCTGCAGTTTCGGTTTTCA1 1 ^^Xr^.rrTGGAATAGTTTAAATATGGTGTT 
^ACTGAACAAGGGGCTTTATGTCCTCA^ATGA^ 

TTTCTGTCATATACATAACTGTATTCACAGAGATATAAMCCT^ TCCAGGAGA 
^CAAGGAATAATGAA^TGTSA^^ 

AACCTGTTCCCAACTCCTGGAGAGCTCCTA^ 

AAGAAAAGCACGTAATGAAGGAAGAA^CAGAA^ 

CATACCAGGAAGCCACATCTCCCCCACACCTGATGGAA 

Mil) 
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TGAAAATTAAGATGCCTTCTA 
TGCTTTTACAT 



FIGURE 2NN 

GAATTGGTTTGCTCTGATCATTGCTGATTCCTTTCCCCA 



SEQ ID NO: 50_AA061797 M ^ttgaaacACCCAAACCTCGTGAACCT 

TATCTTTAGGAGTAACCAGTTTTTCCGCGGCATCAGC^ 
GACTCTTGAAGAAAAATTCTCAAATGTTCAGC^ 

AGTATCTTGGCATTTCATTTCTGACCTCATCACAGA CCTTGTGfiA AGGGCG 
TATCACTTTGCATCTTAGAATTCTACCTGTTTTAGC1 CCGTCTGG TTTTGTCT 

^GTGTTTGTATCAAGGGGCAGGAAA^ 

ISgtgagttgaggctcttcagcaaaa^^ 

oScCGGCCATTTCTCTAAATATCATT^ 

SaSaatgtatgtagaagctggg^ 

TTAGCACTTAATAACTAACTAGCATTATGGTAGTCTAAACT 
TTC^TTAA^TTA^TTTAATAA^AGCCAGAAACATT 



fg//'5 
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FIGURE 200 

SEQ ID NO: 51.AA397553 H ACGGGAGTGGAG GAGCTTCTGGAACT 

GACACCTTCTCCGATGACATGGCCTTCAAACTAfcA caGCACAGGC GTTCCCGG 
GGATCAGATCGGAGCGACCG^ 

GACTTACTAAAAGCTAAACAGACCGAAAAAGAAA^ GAATGA GGAGACTGATGAC 
GGATCGATGAAGGACCGGATATCGGGAAGTT^ 
TATGGGAAGGCGCAGGTAGCCAAA^GCAGCAGCAA^ 
AAGGAGAAGACCAGGAAAGAACGGGAGCTG^ 
CATCGAAAAAGGGAAACACCCAAAAGTTAC^^ 

aggagcccccacaggaagtggtctgaca^ 

TCTTATGGCCAAGATTATGACCTTAGTCCCTCACGAi GTCAGTC CCCCTTACAAG 
TCCTACAAGAAAAGTCCTGGAAGTACCTC^GAGAA 



CATCGAAAAAGGGAAACACCCAAAAG11A GATGATAGC CCCTCGGGAGCT 

aggagcccccacaggaagtggtctgaca^ 

TCTTATGGCCAAGATTATGACCTTAGTCCCTCACGAi GTCAGTC CCCCTTACAAG 

tSacaagaaaagtcctggaagta^ 
gagccttcggcctaccagtccagc^ 

tctgtcagtccctatagcaggagacggtcgtccao CCCTTTCC tgagcaagcgg 
gggcgatcgcccagtccctatggtcgaaggcggtccag^^ 

tctctgagtcggagtccactccccagtagg^^ 
gcIS^caagacattcatcttctcatagtaa^ 
SSccagtatctcacctgtcaggcttc^ 

GACAATGAGAAAGAGGGCTTCCCAAT^ 

MCTGTGGATGTATTCTTGGGGAACTATTCACAMGAA CCCTTGTCCftGC TGTGTGG 
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CGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATG 
CTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTT 
AAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCAT 
GAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCA 
CCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAG 
AACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGAT 
GC71ATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTA 
AACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATC 
CACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTG 
ACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAG 
GAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGATGACCCTTGAAGCTTCA 
AGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACC 
CAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGG 
CCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTT 
CCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCT 
CTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCC 
TTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCAC 
CAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAAC 
ACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCA 
GCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTG 
AGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGAC 
CAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTC 
CTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTAT 
GGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGG 
GGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCA 
AGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAA 

SEQ ID NO: 52_AA7892 3 9_H 

TGAAAATGGAGATGTATGAAACCCTTGGAAAAGTGGGAGAGGGAAGTTACGGAACAGTCA 
TGAAATGTAAACATAAGAATACTGGGCAGATAGTGGCCATTAAGATATTTTATGAGAGAC 
CAGAACAATCTGTCAACAAAATTGCGATGAGAGAAATA7^AGTTTCTAAAGCAATTTCATC 
ACGAAAACCTGGTCAATCTGATTGAAGTTTTTAGACAGAAAAAGAAAATTCATTTGGTAT 
TTGAATTTATTGACCACACAGTATTAGATGAGTTACAACATTATTGTCATGGACTAGAGA 
GTAAGCGACTTAGAAAATACCTCTTCCAGATCCTTCGAGCAATTGACTATCTTCACAGTA 
ATAATGTAATCATTCATCGAGATATAAAACCTGAGAATATTTTAGTATCCCAGTCAGGAA 
TTACTAAGCTCTGTGATTTTGGTTTTGCACGAACACTAGCAGCTCCTGGGGACATTTATA 
CGGACTATGTGGCCACACGCTGGTATAGAGCTCCCGAATTAGTATTAAAAGATACTTCTT 
ATGGAAAGTATGTGCCTGTGGATATCTGGGCTTTGGGCTGTATGATCATTGAGATGGCCA 
CTGGAAATCCCTATCTTCCTAGTAGTTCTGATTTGGATTTACTCCATAAAATTGTTTTGA 
AAGTGNGATTCATGCCAGAACTGAAAGCTAAATTACTGCAGGAAGCAAAAGTCT^ATTCAT 
TAATAAAGCCAAAAGAGAGTTCTAAAGAAAATGAACTCAGGAAAGATGAAAGAAAAACAG 
TTTATACCAATACACTGCTAAGTAGTTCAGTTTTGGGAAAGGAAATAGAAAAAGAGAAAA 
AGCCCAAGGAGATCAAAGTCAGAGTTATTAAAGTCAAAGGAGGAAGAGGAGATATCTCAG 
AACCAAAAAAGAT^AGAGTATGAAGGTGGACTTGGTCAACAGGATGCAAATGAAAATGTTC 
ATCCTATGTCTCCAGATACAAAACTTGTAACCATTGAACCACCAAACCCTATCAATCCCA 
GCACTAACTGTAATGGCTTGAAAGAAAATCCACATTGCGGAGGTTCTGTGACAATGCCAC 
CCATCAATCTAACTAACAGTAATTTGATGGCTGCAAATCTCAGTTCAAATCTCTTTCACC 
CCAGTGTGAGGTTAACTGAAAGAGCAAAAAAGAGACGCACTTCTTCACAATCTATTGGAC 
AAGTTATGCCTAATAGCAGGCAAGAGGATCCAGGTCCTATTCAAAGCGAAATGGAGAAGG 
GTATATTTAATGAGCGAACAGGTCACAGTGACCAAATGGCAAATGAGAACAAAAGGAAGC 
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CTTGAA 

TTTAAAGAA^TGAACCTGTGAGA^^ 

GGTGTCAAAGAAGACCCACACGCTGGGGGTTGTATGA^^ 

AGCAGTAATTTGTTGGCCGCAAATCTCAGTTCAAA^ 

ACTGAAAGAACAAAAAAGAGACGCACTTCTTCACAAACT 

AGCAGACAAGAGGACACAGGTCCCACACAAGTC 

CGAACAGGTCAGAATGACCAAATATCGAGTG^ 

TGCGACAGGAAAGAATTCCATTTCCCTGAACTGC^ATTC^A 

AAAGGGATGGAAGTTAAACAGATAAAAGTGCTG^AGA 

TCTAAAATACCAACTTTACTTAGTATGGACCCAAAT^^ 

GGCGATTGTGAGGGGAAGAATTTGAAGAGGAAC^ 

GTCTTTTAAGTAATCTTAAAAATACAAGCTTGACAATTCCli c ATCAT 

aSLaatgtacataggttgctg^ 

TGTTCCTAGGTTAAACTCCTTGAGATGAAACTATTTCCT 
GTTAACAT 

SEQ ID NO: 54 -^^" ^ G ^GACTTTGGCCTGGCCCGGGTCTTCTCTCCGGAT 

AGCGCCTCAGGCCAGCTCAAGATAGCTGACTTTG^^ AGCT CCTGAACTCCTG 

GGTGGTCGCCTCTACACACATCAGGTGGCCACC^ 

TATGGCGCTCGGCAGTATGACCAGGGCGTTGA^ 

GAGCTGTTGAATGGGTCCCCCCTGTTCCCGGGC^ 

GTGCTTCGCATCCTGGGTACCCCGAGTCCT^ 
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TTGAATGAGCGAGATTATCGGGACCG^ 

CATGGTCATGTTTGTATTGTGTTTGAACTACT^ 
GMCATCACAGTACTTTGGTGTCTACC^^ 

CATCCTTTOTTGACTTATTAAAAAAGAAATGAAATOGS^^ 

CTTCTCTAGAA<aGATTACTTAA<aCT^^T»AC^A« RGraT 

^cattamttattttgtagagttaagtctamtattg™ 



WO 00/73469 



PCT/US00/14842 



FIGURE 2SS 



ACTTAACTTTAAAAGTTTTTCTTCTGTAATTGTTGAACGGGTGATTATTATTAACTCTAG 
ATAAGCAGGTACTAGAAACCAAAACTCAGAAAATGTTTACTGTTAGAATTCTATTAAATT 
TTAAGTGTTGTATTCTTTTTCATTGGGTGATGTCAGGGTGATAACCAGACATTCATGGAA 
AGGCATGCAGTTTGTCCATTGTGACAGTTTGTTTAATAAAACCACATACACACTTTATTT 
AAGATTAAAATCTAACTGGAAAGTCAGCTTGGAAAATGGACATTTCCAAGTATGTTTGGT 
GAGTCACAGATATAAAAATAGAAATTCTGATGAGAGGTTTCAGTTTTTAATACCAAGTCC 
TTAGGAGTCTTAACATTGGCCAGCATCTGTTTATCAAATGACATAAATACGTAAACCTAT 
AAGAATTAAGTTTATTAATTAGGCAATTTATGTCTGTGATAATTCTTACGGGAGAAAGAG 
GATTTGATTGGAAAGCAGTTTGGGAAGAAAGTGCTGCTGAAATTTCCAGAATTTAATTGA 
TTGGTTACATAAACTTTTTGACTTCAAT 

SEQ ID NO: 56_AA557536_H 

AGTAAGGCCCCGCGGGCGTCCTGGCCGCCATGTGCACCGTAGTGGACCCTCGCATTGTCC 
GGAGATACCTACTCAGGCGGCAGCTCGGGCAGGGGAGAACATTCCGGGAAATCACGCTCC 
TCCAGGTGAGTGGCCTGGGCCCTCCAGTCCAATCCCCTTGCCCAGGTACAGATCTCTCCA 
GACAGGAGAGAAACTGGCCTTCTTGGGCCCCAGAGCACAGCCCCTCCTGGCCTTCCAGCC 
GCCTCCGACTCTCTCCCCAGGAGTTTGGGGACCATCCCAACATCATCAGCCTCCTTGACG 
TGATCCGGGCAGAGAACGACAGGGACATTTACCTGGTGTTTGAGTTTATGGACACTGACC 
TGAACGCAGTCATCCGGAAGGGCGGCCTGCTGCAGGACGTCCACGTGCGCTCCATCTTCT 
ACCAGCTCCTGCGGGCCACCCGGTTCCTCCACTCGGGGCACGTTGTGCACCGGGACCAGA 
AGCCGTCCAATGTGCTCCTGGATGCCAACTGCACAGTGAAGCTGTGTGACTTTGGCCTGG 
CCCGCTCCCTGGGCGACCTCCCTGAGGGGCCTGAGGACCAGGCCGTGACAGAGTACGTGG 
CCACACGCTGGTACCGAGCACCGGAGGTGCTGCTCTCTTCGCACCGCTACACCGCTTCCT 
GCCCCAGATACACCCTTGGGGTGGACATGTGGAGTCTGGGCTGTATCCTGGGGGAGATGC 
TGCGGGGGAGACCCCTGTTCCCCGGCACGTCCACCCTCCACCAGCTGGAGCTGATCCTGG 
AGACCATCCCACCGCCATCTGAGGAGXXXAGGCCACGACAGACGCTGGATGCCCTCCTAC 
CGCCAGACACCTCCCCAGAGGCCTTGGACCTCCTTAGGCGACTCCTGGTGTTCGCCCCGG 
ACAAGCGGTTAAGCGCGACCCAGGCACTGCAGCACCCCTACGTGCAGAGGTTCCACTGCC 
CCAGCGACGAGTGGGCACGAGAGGCAGATGTGCGGCCCCGGGCACACGAAGGGGTCCAGC 
TCTCTGTGCCTGAGTACCGCAGCCGCGTCTATCAGATGATCCTGGAGTGTGGAGGCAGCA 
GCGGCACCTCGAGAGAGAAGGGCCCGGAGGGTGTCTCCCCAAGCCAGGCACACCTGCACA 
AACCCAGAGCCGACCCTCAGCTGCCTTCTAGGACACCTGTGCAGGGTCCCAGACCCAGGC 
CCCAGAGCAGCCCAGGCCATGACCCTGCCGAGCACGAGTCCCCCCGTGCAGCCAAGAACG 
TTCCCAGGCAGAACTCCGCTCCCCTGCTCCAAACTGCTCTCCTAGGGAATGGGGAAAGGC 
CCCCTGGGGCGAAGGAAGCGCCCCCCTTGACACTCTCGCTGGTGAAGCCAAGCGGGAGGG 
GAGCTGCGCCCTCCCTGACCTCCCAGGCTGCGGCTCAGGTGGCCAACCAGGCCCTGATCC 
GGGGTGACTGGAACCGGGGCGGTGGGGTGAGGGTGGCCAGCGTACAACAGGTCCCTCCCC 
GGCTTCCTCCGGAGGCCCGGCCCGGCCGGAGGATGTTCAGCACCTCTGCCTTGCAGGGTG 
CCCAGGGGGGTGCCAGGGCTTTGCTTGGAGGCTACTCCCAAGCCTACGGGACTGTCTGCC 
ACTCGGCACTGGGCCACCTGCCCCTGCTGGAGGGGCACCATGTGTGAGCCGCCCTACTCC 
CTTCACCTGGCCCTCTGTTCCTGCCCCAGCNCCTTCCCCAGACCCCTCTCCAGTCTCCTG 
CACCCCTTAGCCCTCCCTGCTTTGCCTGGCCCGTTGAAGTTCCAGGGAGCTTGCCCGGGT 

CTCCTCGGGGGAGCAGATGAGGGCCCTGCCC 
SEQ ID NO: 57_N2 8606_H, MOK_H 

ATGAAGAACTATAAAGCAATTGGCAAAATAGGAGAGGGAACGTTTTCTGAAGTTATGAAG 
ATGCAAAGCCTGAGAGATGGAAACTACTATGCATGTAAACAAATGAAGCAGCGCTTTGAA 
AGTATTGAGCAAGTCAACAACCTACGAGAGATCCAAGCACTGAGGCGCCTGAATCCGCAC 
CCAAACATTCTTATGTTGCATGAAGTGGTTTTTGACAGAAAATCTGGTTCTCTTGCACTA 
ATATGTGAACTTATGGACATGAATATTTATGAGCTAATACGAGGGAGAAGATACCCATTA 
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TCAGAAAAAAAAATTATGCACTATATGTACCAGTTATGTAAGTCCCTGGATCATATTCAC 
AGAAATGGAATATTTCACAGAGATGTAAAACCAGAAAATATACTAATAAAGCAGGATGTC 
CTGAAATTAGGGGACTTTGGCTCCTGCCGGAGTGTCTATTCCAAGCAGCCGTACACGGAA 
TACATCTCCACCCGCTGGTACCGGGCCCCGGAGTGTCTCCTCACTGATGGGTTCTACACG 
TACAAGATGGACCTGTGGAGCGCCGGCTGTGTGTTCTACGAGATCGCCAGTCTGCAGCCC 
CTCTTTCCTGGAGTAAATGAACTGGACCAAATCTCAAAAATCCACGATGTCATCGGCACA 
CCCGCTCAGAAGATCCTCACCAAGTTCAAACAGTCGAGAGCTATGAATTTTGATTTTCCT 
TTTAAAAAGGGATCAGGAATACCTCTACTAACAACCAATTTGTCCCCACAATGCCTCTCC 
CTCCTGCACGCAATGGTGGCCTATGATCCCGATGAGAGAATCGCCGCCCACCAGGCCCTG 
CAGCACCCCTACTTCCAAGAACAGAGGAAAACAGAGAAGCGGGCTCTGGGCAGCCACAGA 
AAAGCTGGCTTTCCGGAGCACCCTGTGGCACCGGAACCACTCAGTAACAGCTGCCAGATT 
TCCAAGGAGGGCAGAAAGCAGAAACAGTCCCTAAAGCAAGAGGAGGACCGTCCCAAGAGA 
CGAGGACCGGCCTATGTCATGGAACTGCCCAAACTAAAGCTTTCGGGAGTGGTCAGACTG 
TCGTCTTACTCCAGCCCCACGCTGCAGTCCGTGCTTGGATCTGGAACAAATGGAAGAGTG 
CCGGTGCTGAGACCCTTGAAGTGCATCCCTGCGAGCAAGAAGACAGATCCGCAGAAGGAC 
CTTAAGCCTGCCCCGCAGCAGTGTCGCCTGCCCACCATAGTGCGGAAAGGCGGAAGATAA 

SEQ ID NO: 58_AB023 153_H , ICK_H 

ATGAATAGATACACAACAATCAGGCAGCTCGGGGATGGAACCTACGGTTCCGTCCTGCTG 
GGAAGAAGCATTGAGTCTGGGGAGCTGATCGCTATTAAAAAAATGAAAAGAAAATTTTAT 
TCCTGGGAGGAATGCATGAACCAACGGGAGGTTAAGTCTTTAAAGAAGCTCAACCATGCC 
AATGTAGTCAAATTAAAAGAAGTTATCAGGGAAAATGATCATCTTTATTTTATCTTCGAG 
TACATGAAGGAAAATCTTTACCAGCTCATTAAAGAGAGAAATAAGTTGTTTCCTGAGTCT 
GCTATAAGGAATATCATGTATCAGATATTACAAGGACTCGCATTTATTCACAAACTCGGC 
TTCTTTCATCGAGACTTAAAGCCTGAGAACCTCCTCTGCATGGGACCAGAACTTGTGAAA 
ATTGCAGACTTTGGTTTGGCCCGAGAAATACGATCAAAACCTCCATATACAGATTATGTA 
TCTACCAGATGGTACAGGGCTCCAGAAGTACTCCTGAGGTCTACCAACTACAGCTCCCCC 
ATTGACGTCTGGGCGGTGGGCTGCATCATGGCAGAAGTTTACACCCTCAGGCCACTCTTC 
CCTGGAGCCAGTGAAATTGACACAATATTCAAAATTTGCCAAGTGCTGGGGACACCAAAA 
AAGACTGACTGGCCTGAAGGCTATCAACTTTCAAGTGCAATGAACTTCCGTTGGCCACAG 
TGTGTACCCAATAACTTAAAGACCTTGATTCCCAATGCTAGCAGTGAAGCAGTCGAGCTC 
CTGAGAGACATGCTTCAGTGGGATCCCAAGAAACGACCAACAGCTAGTCAGGCACTTCGA 
TATCCTTACTTCCAAGTTGGACACCCACTAGGCAGCACCACACAAAACCTTCAGGATTCA 
GAAAAACCACAGAAAGGCATCCTGGAAAGGGCAGGCCCACCTCCTTATATTAAGCCAGTC 
CCACCTGCCCAGCCACCAGCCAAGCCACACACACGAATTTCTTCACGACAGCATCAAGCC 
AGCCAGCCCCCTCTGCATCTCACGTACCCCTACAAAGCAGAGGTCTCCAGGACAGATCAC 
CCAAGCCATCTCCAGGAGGACAAGCCAAGCCCGTTGCTTTTCCCATCCCTCCACAACAAG 
CATCCACAGTCGAAAATCACAGCTGGCCTGGAGCACAAAAATGGTGAGATAAAGCCAAAG 
AGTAGGAGAAGGTGGGGTCTTATTTCCAGGTCAACAAAGGATTCAGATGATTGGGCTGAC 
TTGGATGACTTGGATTTCAGTCCATCCCTCAGCAGGATTGACCTGAAAAACAAGAAAAGA 
CAGAGTGATGACACTCTCTGCAGGTTTGAGAGTGTTTTGGACCTGAAGCCCTCTGAGCCT 
GTGGGCACAGGAAACAGTGCCCCCACCCAGACGTCATATCAGCGGCGAGACACGCCCACC 
CTGAGATCTGCAGCCAAGCAGCACTATTTGAAGCACTCTCGATACTTGCCTGGGATCAGT 
ATAAGAAATGGCATACTCTCGAATCCAGGCAAGGAATTTATTCCACCTAATCCATGGTCT 
AGTTCTGGCTTGTCTGGAAAATCTTCAGGGACAATGTCAGTAATCAGCAAAGTAAATTCA 
GTTGGTTCCAGCTCTACAAGTTCTAGTGGACTGACTGGAAACTATGTCCCTTCCTTTCTG 
AAAAAAGAAATCGGTTCTGCTATGCAGAGGGTACACCTAGCACCTATTCCAGACCCTTCC 
CCTGGTTATTCCTCCCTGAAGGCCATGAGACCTCATCCTGGGCGACCATTCTTGGACACC 
CAGCCTAGAAGCACTCCTGGGTTGATACCACGGCCTCCAGCCGCCCAGCCAGTGCATGGC 
CGGACAGACTGGGCTTCCAAGTACCCATCCCGGCGGTGA 
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SEQ ID NO: 5 9_AA83 994 0_M 

AGCAG C AACAATGGTGGCATGAGTG C AG AGG AGG AGATAGGG C C TGGGG CTGAG C CTATG 
AGAGGACCAAGCTTGGCTACAAGGGACTGGAGAGATGAGACTGTTGGGACCACAGACCTG 
CAGCAAGGCATAGACCCAGGAGCAGTGAGCC CTGAG CCTGGGAAGGACCACGCAGCCCAG 
GGCCCAGGAAGAACTGAAGCTGGAAGGGTATCTTCTGCTGCAGAGGCTGCCATTGTGGTT 
CTAGATGACAGCGCAGCACCCCCAGCCCCTTTTGAACACCGGGTAGTGAGCATCAAAGAT 
ACCCTGATCTCAGCAGGCTACACGGTATCCCAACATGAAGTCTTAGGAGGGGGTCGGTTT 
GGCCAGGTGCACAGGTGTACAGAGAGGTCTACAGGCCTTGCACTGGCAGCCAAGATCATC 
AAAGTGAAGAACGTAAAGGACCGGGAGGATGTGAAGAATGAGGTCAACATCATGAACCAG 
CTCAGCCACGTAAACTTGATCCAACTTTATGATGCGTTTGAGAGCAAGAACAGCTTCACT 
CTGATCATGGAGTATGTGGATGGAGGCGAACTCTTTGACCGGATCACGGATGAGAAGTAC 
CACCTCACTGAGTTGGATGTGGTCTTGTTCACGAGGCAGATCTGTGAGGGTGTGCATTAC 
CTGCATCAGCACTATATCCTGCACCTGGACCTCAAGCCTGAGAACATATTGTGTGTCAGC 
CAGACAGGGCATCAAATTAAGATCATTGACTTTGGGCTGGCTAGAAGATACAAGCCTCGG 
GAGAAGCTAAAGGTGAACTTTGGTACTCCGGAGTTCCTGGCCCCAGAAGTTGTTAACTAT 
GAGTTTGTGTCATTTCCAACAGACATGTGGAGTGTGGGAGTTATCACCTACATGCTACTC 
AGTGGTTTGTCCCCATTTCTAGGGGAGACAGATGCAGAGACCATGAATTTTATTGTGAAC 
TGCAGCTGGGATTTCGATGCTGATACCTTCAAAGGGCTGTCGGAGGAAGCCAAGGACTTT 
GTTTCCCGGTTACTGGTCAAAGAGAAGAGCTGTAGGATGAGCGCCACACAGTGCCTGAAA 
CACGAGTGGTTAAATCACCTGCCTGCCAAAGCCTCGGGCTCCAACGTTCGCCTCAGATCC 
CAACAACTGCTGCAGAAATATATGGCTCAGAGTAAATGGAAGAAACATTTCCACGTGGTG 
GCTGCAGTCAACAGGCTACGGAAATTTCCAACGTGTCCCTAATCTTCAACTCTGGTGTTC 
CACTGGGCCTGGGAATTCTTGAGGCAACACGAAGTGGTAATATGAAGAGATTACTCAAGA 
TTTTATGTAGATTGGCGCTTTGCTATTATTGATTTTTCTTATTTTGCAAAGAATGATGGA 
AGGAAGCAAGAAAGAAAGAAAAGAAGAAAAGGGGGAAGAAAAGGAAAAGGCAGAAAGCAA 
GGAAACAGGCTACGTTGTTGCTCTTCTTGTAGGTGAAAGTGTTTTTATTAAAAGCCCTAG 
GAATGTTTTTCTGCCTCGT7^AGGTCAGCAGGTCTCATATGCTGCTTGCTACCCCGCACCC 
TTCCTTTTGGTAATAAGAGCAGGCACGCTCAGGATGGGCAGGGAAATCCTACTTGGCTTT 
TGGTCAAATTTGAATTCTAAACTTGTCATGATTAAAGAAGCCAGTAGGGAGGGAGGTATG 
GAAGAGGGAGGAATTAGGTCCAACAGTGGGGGATGAATTTGACCGAAACATTGTATAAAA 
TTCTTAAAGAATTAATAAAATATATTTTTAAAGGAG 

SEQ ID NO: 6 0_AA4 6 0i32_H 

GGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGTAACCACTTACAGGCCGGAAG 
TGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCAGGATTGCCGAAGAAGTCCCA 
GGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAGAGACAGCTGATCGGTTGGAG 
CTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTACGCCGGCCGATGGCGAGGAG 
CCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGCGGAGCAGCCGCTTCTTGAGC 
GGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGC 
CGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCG 
CGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCT 
GGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTTCCAACTGCTTATATATGGAA 
GAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGTCCACTATGGAGACTGAAAAA 
ACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGCAGGTTTTGGCTCGAATGCAC 
GATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACATGCTCCTGAAACCCCCCCTG 
GAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTTTCATTTCAGCACTTCCAGAG 
GATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGTACCCATCCCAACACT 
GAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCTCCTCCAAAAAGGCCAGGCCA 
GTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAAAGAGGTCCATGGTTGGGTAG 
AAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTTTCAAAGTAAATTTGAAGAAA 
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TGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAGATATTTTTAAGTGGTATGTG 
ATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACTATGTGTCTAAGTCATGTTCT 
AGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTTCTCCCAGATTGTGACATGTA 
TATCTCAGATACATGGGTGTGGCATTGAACCACATAATGAGAACATTATTCTCTTTTTAG 
TCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTCGCTGAGCTTACTGGCCCTCT 
AACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTATATTTATTTTGAAACCAGTTT 
AATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATACAGCATGG 

SEQ ID NO: 61_SGK034_H 

CAGAGAGAGAAGGTAAACCAAGGGAACATGCCAGGGCTTCAGAGCACCTTCCTAGCCATG 
GACACGGAGGAGGGGGTAGAGGTGGTGTGGAACGAGCTCCACTTCGGAGACAGGAAGGCC 
TTCGCGGCGCACGAGGAGAAGATCCAGACCGTGTTCGAGCAGCTGGTGCTGGTGGACCAC 
CCGAACATCGTGAAGTTGCACAAGTACTGGCTGGATACCTCTGAGGCCTGCGCGAGGGTC 
ATCTTCATCACAGAGTACGTGTCATCAGGCAGCCTCAAGCAATTCCTCAAAAAGACCAAG 
AAGAACCACAAGGCCATGAACGCCCGGGCCTGGAAGCGCTGGTGCACGCAGATCCTGTCT 
GCGCTCAGCTTCCTGCACGCCTGCAGCCCCCCAATCATCCACGGGAACCTGACCAGCGAC 
ACCATCTTCATTCAGCACAACGGCCTCATCAAGATCGGCTCCGTGTGGCACCGAATCTTC 
TCCAATGCACTTCCAGATGATCTCCGAAGCCCCATCCGCGCTGAGCGAGAGGAACTTCGG 
AACCTGCACTTCTTCCCCCCAGAGTATGGAGAGGTGGCCGATGGGACCGCTGTGGACATC 
TTCTCCTTTGGGATGTGTGCGCTGGAGATGGCTGTACTGGAAATCCAGACCAATGGGGAC 
ACCCGGGTCACAGAGGAGGCCATTGCTCGCGCCAGGCACTCGCTGAGTGACCCCAACATG 
CGGGAGTTCATCCTTTGCTGCCTGGCCCGGGACCCTGCCCGCCGGCCCTCTGCCCACAGC 
CTCCTCTTCCACCGCGTGCTCTTCGAGGTGCACTCGCTGAAGCTCCTGGCAGCCCACTGC 
TTCATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTGGAGGAGAAGACCAAGGCCATG 
GACCTGCACGCGGTCTTGGCGGAGCTTCCCCGGCCCCGCAGGCCCCCGCTGCAGTGGCGG 
TACTCGGAAGTCTCCTTCATGGAGCTGGACAAATTCCTGGAGGATGTCAGGAATGGAATC 
TACCCACTGATGAACTTTGCAGCCACTCGACCCCTGGGGCTGCCCCGTGTGCTGGCCCCA 
CCCCCGGAGGAGGTCCAAAAGGCCAAGACCCCGACGCCAGAGCCCTTTGACTCTGAGACC 
AGAAAGGTCATCCAGATGCAGTGCAACCTGGAGAGAAGCGAGGACAAGGCGCGCTGGCAT 
CTCACTCTGCTTCTGGTGCTGGAAGACCGGCTGCACCGGCAGCTGACCTACGACCTGCTC 
CCAACGGACAGCGCCCAGGACCTCGCCTCGGAGCTCGTGCACTATGGCTTCCTCCACGAG 
GACGACCGGATGAAGCTGGCCGCCTTCCTGGAGAGCACCTTCCTCAAGTACCGTGGGACC 
CAGGCCTGACCCGGAGCCCCAGCCCCAGGGGACCATGCCGGGGTGCTGCCCGGGCAGGCC 
ATGTTGGGGAGACTCCAGCACCGTGGGGCTGCCCTCCTCCATGCGCCTGGGAGCACAAAG 
GCCCCGGTAGTGAAGGAACCCCCCGTCTCCTGAGAGTGGGGCTGACCCTGCCTTGGGCGC 
CGAGGGGTTGGGGGGTGGGTGTGGGGGAGCCGTTAGGCCTCCCAGGTCCTTAGGATCAGG 
GTTGCCCCCAGAACCCCTTCCCATATCCTCCATTCTCCGCCCTGAGTTCCTACCCAGGCT 
GCCTGGCTGGGGCCACTGCCTCCTCAGCATGCAGGAGGCTGCCCTGTAGGGAACCCCAGC 
TCTGGGGCTTGGGGGTGAGGGTCAGCCCTGGACAGACCTCTGCCCAGGGAACTGCTCCAT 
GGGGTCTGGGAGAGCAGCCATCCCCTGCTGGCACCATAGACCCACACAAGGAGCCTGCAC 
AGCAAGCCAGCGGTGACACACCTGCAGGTGTCAGGCATGGCACTGGGCACAACAGGGACC 
TGGCAGGAGAAACAGACCACAGAGAGGTCTGGAGTTGAGGCTGTTGTCAGCAAAGCCCCT 
GGTCCCACACAGCTCTGCCCTAGAGCCACCTCTTTGACCCTTTACCCACCCTGAGACCAG 
AACTTGCAGCCCCTCTGCAGATCTCCTCTGGCCACTGCAGCCCCTCCAATGGGCTTTTTC 
TCTCATGCATTCCCTGGCCTGGAGGCGTCAGGGACCCCACATCCTCCCTGCTCCTCAGAC 
TCACAGCCCCTCCATGTTACCTCCCGCACCTCCTCCCTGGGGCAGCTGCTCCCTGGGCCT 
CTGAGGATGTCAGCTCCTGGCTCCCTGCCTCTCTCCCACTCCACTCCTGGCTCAGTCTTA 
GAGATTTCTATGCCCTCATGGATTCTACCCCTGCCTTCCTGGCCTCTTGATTCTTGGCTT 
GCCTCTCCTCCAATTCCAAACTTAGTGAAATGGCCTTAAGCATTTTAAACTGTATGTATA 
CATTAGCGCATTCATGCCTTTCTAAACGCATTTCAAATGTCAACCAGGAAGGCACACCAC 
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TGTATTAGTTTTATACTGCCGCTGTAAAATTTACCACAAACTTAGTGACTTAACACAAAT 
TTATTGCAATTCTGTAGGCTGGAAGTCTGACTATGGGTCTCACTGGACTAGAATCAAGGC 
TGGCAGGCTGCCTTCCTTCCTGGAGGTTCTAGGGGAGACTCTGTCTCCTGCTCCTTCAGG 
CTGCTGGCAGAATCCACATCCTTTCGGTGGCAGGGCCAAGGTCCCCACTTTCTTGCTGAC 
TGTAAACTAAGGCCACTTCCAGCTTGTAGAGGCTGCCTACATTCCTTGGCTCTTGGCCCC 
CTCCTCCATCTTCAGAGCTAGCAGGTTCAGTCTGTGTCACGAACCATTTCTCTGGTTCCC 
TGCAGACAGGAAAGGTTGTCCCTAAGGACTCATGAGATTAGGTTGGGCCCAGCCAGATAA 
TACATGATAATCTCCCTCCTCAAGGTTTTTAATATTAAACACATCTGCAGGACACATTTT 
GCCATGTAAACTAACATTCACTGGTTCCAGGGATTAAGGAATGAACCTCTTTTGTTGGGG 
AAGGGTGGCATTCTGCTGACCACAGCACTCCAACCAAAAGCCAAAAACCAAAGCAAGACT 
TACTAACGCATATCAAATAAATTAAAGGTACAAAATCGTGAATCTCAGTTATCTTAAATA 
TTCCAATACTATTTACAAAATTATTCAAATTCTCACGCCTTCCAACTCAAAATTAGCAAT 
CTAAAGTAATTTCCATATCCTAGATGGAAACCCTCATGCTAAACTGTCTGATTATGCATG 
GTTCTAAATGGTTTCAGTGGCAAATACATAACATTGTACTACTGATTAAACTGAACTTAA 

AAGC 

SEQ ID NO: 62_AA103218_M SGK034_M 

CCACGCGTCCGCACCAGAGTATGGCGAAGTCAATGATGGGACTGGCTTTGTGGACATCTT 

CTCCTTCGGGATGTGTGCACTGGAGATGGCTGTACTCGAGATCCAAGCCAACGGGGATAC 

CAGAGTCACAGAAGAGGCCATCGCTCGAGCCAGGCACTCACTGAGTGACCCCAACATGCG 

GGAATTCATCCTCTCCTGCCTGGCCCGGGACCCTGCCCGCCGACCCTCAGCCCACAACCT 

CCTCTTCCACCGAGTGCTCTTTGAGGTGCACTCGCTGAAGCTGCTGGCAGCTCACTGCTT 

CATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTAGAGGAAAAGACCAAGGCCATGGA 

CCTCCATGCAGTTTTGGCTGAGATGCCGCAGCCCCATGGACCCCCAATGCAGTGGCGGTA 

CTCAGAGGTCTCCTTCTTGGAGCTGGACAAATTCCTAGAGGATGTCAGGAACGGGATCTA 

TCCACTGATGAACTTTGCGGCTGCTCGGCCCTTGGGGCTTCCCCGTGTGTTGGCCCCACC 

CCCAGAGGAAGCCCAAAAGGCCAAAACTCCAACGCCAGAACCCTTTGACTCGGAGACCAG 

GAAGGTGGTCCAGATGCAGTGCAACCTGGAAAGAAGCGAGGACAAGGCTCGGTGGCACCT 

TACTCTGCTCTTGGTGCTTGAGGACCGGCTACATCGGCAGCTGACCTATGATCTGCTCCC 

AACGGACAGTGCCCAGGACCTCGCTGCTGAACTAGTGCATTATGGCTTCCTGCACGAGGA 

TGACAGGACAAAGCTAGCAGCCTTTCTGGAGACCACTTTTCTCAAGTACCGAGGGACGCA 

AGCGTGACCTTCCCAGTCCTGACGGCCCAGCAGAGATACAGGGGCTCAGGGTTGTCCACT 

TGGCAAAGAGCCCCCACACTGCTCAAAGCTGCCTTCTGCCTGTGTTCCCTGGAACTGAAC 

ACAGGCCCTGCTAGTGAAGACACCCCCACCCCCCAGCTTTCTGCAGCAGTGTGGGACCCT 

GGGGTGGTGATGGAGCCCTGAGCCTGGACGAGAGTGGATACAGGTCAGTTAGGGGAACCG 

CTCCATCTGGTACTAGACAACAGCCATGCCTTCAGGTGGCATAGAAACCTAGGGAAGGAG 

CCTGAACTCAGGTGTCACAGTGCTGGGCATCAGGCAGACCAGACCTGACCTGATTGGAGA 

ACTGTAGACTAGATAGCTTGGAGTTGAACCCATGGCCAGGGAATTCCTTGGTCCTGCTCA 

GACCAGTCCTGATCCCTTGCAGACCTGCCTTGAGCCCTCTTTCTGATCTTCCACACTCTT 

GAGACCAGGACCTGTGTCCTCCCCAAAGCCCTTGGGAAGGATCTTTCTATTCATCATCCC 

TCTGGCCTAGGGGCTCAGGGGTCAGGCATCCTCCACATTCCCTCCCTGGGGAAGTTGTGT 

GTTTGAGTTGAGGATGTGGGTTCCTGGCTCCCTCTTTCTCCCCAGCCCAACTTGTCTCTT 

TCTTACTGGTTTCAAAGTCCTGATGAACGCTTCCCCTCAGAGCCACCCTGGTTTCCTTGG 

TTCTTGAACTGCCTCTCTCCCAACTTCAAACCAGGTCTTAAACGTTTTTTAAATGCATAT 

ATAAATGTAATGCAGTCACGGTCCTTTTTAAACACTTTGTGTATGAAACCAGGAAAGCTC 

ACTATTGTATTAGGAATAGTTCCACATTGCTGCTGTTAACAGATATCATAAACCCAGTGG 

TTTGAGACGACACACACACACACACACACACACACACACAGAGAGAGAGAGAGTTCTGTA 

CATCAAGTGTGATCCAGGCTCTCACTAGATTAATACCCAGGCTAAGTTCCTTTCTGGAAG 

CTGGGACTTACCTCCTGCTCCTTCAAGCTATTGGCAGAACTCACTTCCCTGCAATGGTAA 

GGCAGAAATCCCTATTTTCTCAACAGCTGCCAACTAAGAACCCCTCTCAGCTTCTAGAGG 
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CCACCAACTTTTCTTAGTTCTTCTTTCTCCCCCCTCAAGACCAGCAGCGTCAAGTTGAAT 
CTTTGTCCTGGGCTAGCTGACTGGCTTGCCACTGCTGGGAAGAGTTGGGGCCTTTTGTGA 
GTAGGTTGGACCCACCAGGATAACCGAGGATGATCCCCTTCTCAGGGTCTATAGATGAAC 
CACACCTGCGCAGTTCCTTCTGCTGTCATCCTGGGCTTTGGTGCTTGGAGAACAGCCGTG 
GGCGGTGGGTGTTGTTACTGTGGTACCTACCATGCCATCTTAACCGAAACCAAGACCTAA 
AATAAAACAGATTTGTCATGGGACATCTAATAAATTAAATGAACTCTG 

SEQ ID NO: 63_NEK7_H, N34132_H 

CACGAATCCGAGCCCGCTCGCCTCTCTCCAGCGAACCGACCATGTCTGGCGGCGCCGCAG 

AGAAGCAGAGCAGCACTCCCGGTTCCCTGTTCCTCTCGCCGCCGGCTCCTGCCCCCAAGA 

ACGGCTCCAGCTCCGATTCCTCCGTGGGGGAGAAACTGGGAGCCGCGGCCGCCGACGCTG 

TGACCGGCAGGACCGAGGAGTACAGGCGCCGCCGCCACACTATGGACAAGGACAGCCGTG 

GGGCGGCCGCGACCACTACCACCACTGAGCACCGCTTCTTCCGCCGGAGCGTCATCTGCG 

ACTCCAATGCCACTGCACTGGAGCTTCCCGGCCTTCCTCTTTCCCTGCCCCAGCCCAGCA 

TCCCCGCGGCTGTCCCGCAGAGTGCTCCACCGGAGCCCCACCGGGAAGAGACCGTGACCG 

CCACCGCCACTTCCCAGGTAGCCCAGCAGCCTCCAGCCGCTGCCGCCCCTGGGGAACAGG 

CCGTCGCGGGCCCTGCCCCCTCGACTGTCCCCAGCAGTACCAGCAAAGACCGCCCAGTGT 

CCCAGCCTAGCCTTGTGGGGAGCAAAGAGGAGCCGCCGCCGGCGAGAAGTGGCAGCGGCG 

GCGGCAGCGCCAAGGAGCCACAGGAGGAACGGAGCCAGCAGCAGGATGATATCGAAGAGC 

TGGAGACCAAGGCCGTGGGAATGTCTAACGATGGCCGCTTTCTCAAGTTTGACATCGAAA 

TCGGCAGAGGCTCCTTTAAGACGGTCTACAAAGGTCTGGACACTGAAACCACCGTGGAAG 

TCGCCTGGTGTGAACTGCAGGATCGAAAATTAACAAAGTCTGAGAGGCAGAGATTTAAAG 

AAGAAGCTGAAATGTTAAAAGGTCTTCAGCATCCCAATATTGTTAGATTTTATGATTCCT 

GGGAATCCACAGTAAAAGGAAAGAAGTGCATTGTTTTGGTGACTGAACTTATGACGTCTG 

GAACACTTAAAACGTATCTGAAAAGGTTTAAAGTGATGAAGATCAAAGTTCTAAGAAGCT 

GGTGCCGTCAGATCCTTAAAGGTCTTCAGTTTCTTCATACTCGAACTCCACTTATCATTC 

ACCGCGATCTTAAATGTGACAACATCTTTATCACCGGCCCTACTGGCTCAGTCAAGATTG 

GAGACCTCGGTCTGGCAACCCTGAAGCGGGCTTCTTTTGCCAAGAGTGTGATAGGTACCC 

CAGAGTTCATGGCCCCTGAGATGTATGAGGAGAAATATGATGAATCCGTTGACGTTTATG 

CTTTTGGGATGTGCATGCTTGAGATGGCTACATCTGAATATCCTTACTCGGAGTGCCAAA 

ATGCTGCGCAGATCTACCGTCGCGTGACCAGTGGGGTGAAGCCAGCCAGTTTTGACAAAG 

TAGCAATTCCTGAAGTGAAGGAAATTATTGAAGGATGCATACGACAAAACAAAGATGAAA 

GATATTCCATCAAAGACCTTTTGAACCATGCCTTCTTCCAAGAGGAAACAGGAGTACGGG 

TAGAATTAGCAGAAGAAGATGATGGAGAAAAAATAGCCATAAAATTATGGCTACGTATTG 

AAGATATTAAGAAATTAAAGGGAAAATACAAAGATAATGAAGCTATTGAGTTTTGTTTTG 

ATTTAGAGAGAGATGTCCCAGAAGATGTTGCACAAGAAATGGTAGAGTCTGGGTATGTCT 

GTGAAGGTGATCACAAGACCATGGCTAAAGCTATCAAAGACAGAGTATCATTAATTAAGA 

GGAAACGAGAGCAGCGGCAGTTGGTACGGGAGGAGCAAGAAAACAAAAAGCAGGAAGAGA 

GCAGTCTCAAACAGCAGGTAGAACAATCCAGTGCTTCCCAGACAGGAATCAAGCAGCTCC 

CTTCTGCTAGCACCGGCATACCTACTGCTTCTACCACTTCAGCTTCAGTTTCTACACAAG 

TAGAACCTGAAGAACCTGAGGCAGATCAACATCAACAACTACAGTACCAGCAACCCAGTA 

TATCTGTGTTATCTGATGGGACGGTTGACAGTGGTCAGGGATCCTCTGTCTTCACAGAAT 

CTCGAGTGAGCAGCCAACAGACAGTTTCATATGGGTTCCCAANNCATGAACAGGCACATT 

CTACAGGCACAGTCCCAGGGCATATACCTTCTACTGTCCAAGCACAGTCTCAGCCCCATG 

GGGTATATCCACCCTCAAGTGTGCAGCAGGGAATACAGCAGACAGCCCCTCCTCAACAGA 

CAGTGCAGTATTCACTTTCACAGACATCAACCTCCAGTGAGGCCACTACTGCACAGCCAG 

TGAGTCAGCCTCAAGCTCCACAAGTCTTGCCTCAAGTATCAGCTGGAAAACAGAGTACTC 

AGGGAGTCTCTCAGGTTGCTCCTGCAGAGCCAGTTGCAGTAGCACAGCCCCAAGCTACCC 

AGCCGACCACTTTGGCTTCCTCTGTAGACAGTGCACATTCAGATGTTGCTTCAGGTATGA 

GTGATGGCAATGAGAACGTCCCATCTTCCAGTGGAAGGCATGAAGGAAGAACTACAAAAC 
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GGCATTACCGAAAATCTGTAAGGAGTCGCTCTCGACATGAAAAAACTTCACGCCCAAAAT 

TAAGAATTTTGAATGTTTCAAATAAAGGAGACCGAGTAGTAGAATGTCAATTAGAGACTC 

ATAATAGGAAAATGGTTACATTCAAATTTGACCTAGATGGTGACAACCCCGAGGAGATAG 

CAACAATTATGGTGAACAATGACTTTATTCTAGCAATAGAGAGAGAGTCGTTTGTGGATC 

AAGTGCGAGAAATTATTGAAAAAGCTGATGAAATGCTCAGTGAGGATGTCAGTGTGGAAC 

CAGAGGGTGATCAGGGATTGGAGAGTCTACAAGGAAAGGATGACTATGGCTTTTCAGGTT 

CTCAGAAATTGGAAGGAGAGTTCAAACAACCAATTCCTGCGTCTTCCATGCCACAGCAAA 

TAGGCATTCCTACCAGTTCTTTAACTCAAGTTGTTCATTCTGCGGGAAGGCGGTTTATAG 

TGAGTCCTGTGCCAGAAAGCCGATTACGAGAATCAAAAGTTTTCCCCAGTGAAATAACAG 

ATACAGTTGCTGCCTCTACAGCTCAGAGCCCTGGAATGAACTTGTCTCACTCTGCATCAT 

CCCTTAGTCTACAACAGGCCTTTTCTGAACTTAGACGTGCCCAAATGACAGAAGGACCCA 

ATACAGCACCTCCAAACTTTAGTCATACAGGACCAACATTTCCAGTAGTACCTCCTTTCT 

TAAGTAGCATTGCTGGAGTCCCAACCACAGCAGCAGCCACAGCACCAGTCCCTGCAACAA 

GCAGCCCTCCTAATGACATTTCCACATCAGTAATTCAGTCTGAGGTTACAGTGCCCACTG 

AAGAGGGGATTGCTGGAGTTGCCACCAGCACAGGTGTGGTAACTTCAGGTGGTCTCCCCA 

TACCACCTGTGTCTGAATCACCAGTACTTTCCAGCGTAGTTTCAAGTATCACAATACCTG 

CAGTTGTCTCAATATCTACTACATCCCCGTCACTTCAAGTCCCCACATCCACATCTGAGA 

TCGTTGTTTCTAGTACAGCACTGTATCCTTCAGTAACAGTTTCAGCAACTTCAGCCTCTG 

CAGGGGGCAGTACTGCTACCCCAGGTCCTAAGCCTCCAGCTGTAGTATCTCAGCAGGCAG 

CAGGCAGCACTACTGTGGGAGCCACATTAACATCAGTTTCTACCACCACTTCATTCCCAA 

GCACAGCTTCACAGCTGTCCATTCAGCTTAGCAGCAGTACTTCTACTCCTACTTTAGCTG 

AAACCGTGGTAGTTAGCGCACACTCACTAGATAAGACATCTCATAGCAGTACAACTGGAT 

TGGCTTTCTCCCTCTCTGCACCATCTTCCTCTTCCTCTCCTGGAGCAGGAGTGTCTAGTT 

ATATTTCTCAGCCTGGTGGGCTGCATCCTTTGGTCATTCCATCAGTGATAGCTTCTACTC 

CTATTCTTCCCCAAGCAGCAGGACCTACTTCTACACCTTTATTACCCCAAGTACCTAGTA 

TCCCACCCTTGGTACAGCCTGTTGCCAATGTGCCTGCTGTACAGCAGACACTAATTCATA 

GTCAGCCTCAACCAGCTTTGCTTCCCAACCAGCCCCATACTCATTGTCCTGAAGTAGATT 

CTGATACACAACCCAAAGCTCCTGGAATTGATGACATAAAGACTCTAGAAGAAAAGCTGC 

GGTCTCTGTTCAGTGAACACAGCTCATCTGGAGCTCAGCATGCCTCTGTCTCACTGGAGA 

CCTCACTAGTCATAGAGAGCACTGTCACACCAGGCATCCCAACTACTGCTGTTGCACCAA 

GCAAACTCCTGACTTCTACCACAAGTACTTGCTTACCACCAACCAATTTACCACTAGGAA 

CAGTTGCTTTGCCAGTTACACCAGTGGTCACACCTGGGCAAGTTTCTACCCCAGTCAGCA 

CTACTACATCAGGAGTGAAACCTGGAACTGCTCCCTCCAAGCCACCTCTAACTAAGGCTC 

CGGTGCTGCCAGTGGGTACTGAACTTCCAGCAGGTACTCTACCCAGCGAGCAGCTGCCAC 
GGACCTTCTCTAACCCAGTCCCAGCAACCTCTAGAGGATCTTGATGCTCAAT 

TGAGAAGAACACTTAGTCCAGAGATGATCACAGTGACTTCTGCGGTTGGTCCTGTGTCCA 
TGGCGGCTCCAACAGCAATCACAGAAGCAGGAACACAGCCTCAGAAGGGTGTTTCTCAAG 
TCAAAGAAGGCCCTGTCCTAGCAACTAGTTCAGGAGCTGGTGTTTTTAAGATGGGACGAT 
TTCAGGTTTCTGTTGCAGCAGACGGTGCCCAGAAAGAGGGTAAAAATAAGTCAGAAGATG 
CAAAGTCTGTTCATTTTGAATCCAGCACCTCAGAGTCCTCAGTGCTATCAAGTAGTAGTC 
CAGAGAGTACCTTGGTGAAACCAGAGCCGAATGGCATAACCATCCCTGGTATCTCTTCAG 
ATGTGCCAGAGAGTGCCCACAAAACTACTGCCTCAGAGGCAAAGTCAGACACTGGGCAGC 
CTACCAAGGTTGGACGTTTTCAGGTGACAACTACAGCAAACAAAGTGGGTCGTTTCTCTG 
TATCAAAAACTGAGGACAAGATCACTGACACAAAGAAAGAAGGACCAGTGGCATCTCCTC 
CTTTTATGGATTTGGAACAAGCTGTTCTTCCTGCTGTGATACCAAAGAAAGAGAAGCCTG 
AACTGTCAGAGCCTTCACATCTAAATGGGCCGTCTTCTGACCCGGAGGCCGCTTTTTTAA 
CTAGGGATGTGGATGATGGTTCCGGTAGTCCACACTCGCCCCATCAGCTGAGCTCAAAGA 

SotcctLccagaatctaagtcaaagccttagtaattcatttaa 

GTAGCGACAATGAGTCAGATATCGAAGATGAAGACTTAAAGTTAGAGCTGCGACGACTAC 

gIgataaacatctcaaagagattcaggacctgcagagtcgccagaagcatgaaattgaat 
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CTTTGTATACCAAACTGGGCAAGGTGCCCCCTGCTGTTATTATTCCCCCAGCTGCTCCCC 

TTTCAGGGAGAAGACGACGACCCACTAAAAGCAAAGGCAGCAAATCTAGTCGAAGCAGTT 

CCTTGGGGAATAAAAGCCCCCAGCTTTCAGGTAACCTGTCTGGTCAGAGTGCAGCTTCAG 

TCTTGCACCCCCAGCAGACCCTCCACCCTCCTGGCAACATCCCAGAGTCCGGGCAGAATC 

AGCTGTTACAGCCCCTTAAGCCATCTCCCTCCAGTGACAACCTCTATTCAGCCTTCACCA 

GTGATGGTGCCATTTCAGTACCAAGCCTTTCTGCTCCAGGTCAAGGTAATAAAGCAACCA 

TCATCGTCCAAAAACAATAAAATGGAGATGTTGCCATACCTGGGACAAAAGCCTGTTAAG 

GCGGGTTGGGAGACTAGCTGACCAGAACACAGCCTGTGTGTTGTACACTGAAGAATCTGG 

GTGAAAAGGGAAGTGGAGTGATAATGAGAATCGGTGGGCTCACTGCTCCCATTAGGTGAA 

ATTACTTTTTTTCAAGGAATTACAGTGAAAAGTTACATCTGTGTGGCCTATATGACTTGC 

TCATTTGGGATTTGGAACTTAGGCTTTAATATTAGGCTGAGATTTCCTGGATGAAATTCT 

AAGGTGTTTTAGCAGTTTCTGAAGCTAATACATTTTCTTAGCCATTGTAGAATTTTGTTA 

CTTTTAAGTATGGGAGTGGCATACTAAAATGAATAACCTTACAATTCAGTTTTTTATCCA 

TAATCTACTTTCCAAATATAGCTCTGTTTATTAGTGATTGCTGAAAAAATTCCCACAGAG 

GAAAGAGCTTTTAGTCATATTAGAACAAGAATTGAAAAGACTTGGGCATCTGGGTGAGAA 

GAATGAAAAAAATATAGGTACTGGCTTATGTGCCTTTGCCACAGTTTCACAGAAATTAGA 

GATCAGTCTCTTCACAGGAAGAATGCACTTGATTGGTAAGGAGGGCAAACTAGCTAGCAT 

TATTCGAACTAAGAAAAGCTTCCGCATTTTGCAGATGGGTAGAATTAAGACCTAATATTT 

CATCTCTTACATATCTGACCTTCCCCCCAGAAGCTTGTTCTTCTGTGTGCCATCTTAGTG 

CATTTCACCACTCCAGCCTCAAGTTTCTAACATCTTGTAGTTGTGTTCTGTCTCTTCTCC 

TCTCTCTGTTCTACCCTGTTTTTCCCCTCTCACAGGCTGTGCGAAGTTTAACTGTGCATC 

TGAACAGGTGACATTCAAACCTGGTGGCAGGAGGACCCGATTTCTGAGTACGCCCTGCTT 

GGCTCTTTGTGTGTAACACCTTTACTCCTTCCTTGTCCTTGTGTTTCTGCTGCTTGGATC 

TGATGTTTCACGCAGTCCATTTTCATTTGTCTCTTTTTGTATATCATCTACTCAGTGGCT 

TGGCTGAATTACTGTTACCCTCAGAAGTTTGGGCCCCCACATTAATTATGATAAAAAATG 

TCAAAATAACAAGTTATCTACAAATTTCAATGTAACTTTCTGGTAGAAGTGCTTCTTCAT 

GGATCTGTGACAGAGAGTGGATATGGTATCTAGGCAATAGATTGCTGGGTCATTTAGAAT 

AATGAAGACTGAACTCCACAGTCGTAGTCAGTGCTGTCTGTCTGCCCTAGCATTAGAAAT 

GAGAGAAATCAGCCAGACACGGTGGCGTACACCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGCGGGAAGATTGCTTGAGGCCAGGAGCTCGAGACCAACCCTGGGCAACATGGTGATACC 

CCATCTCT 

SEQ ID NO: 64_BCON3_H 

GCGGAGCGCAGCTGTGAGGGAGTCGCTGTGATCCGGGGCCCCGGAACCCGAGCTGGAGCT 
GAAGCGCAGGCTGCGGGGCGCGGAGTCGGGAGGCCTGAGTGTTCCTTCCAGCATGTCGGA 
GGGGGAGTCCCAGACAGTACTTAGCAGTGGCTCAGACCCAAAGGTAGAATCCTCATCTTC 
AGCTCCTGGCCTGACATCAGTGTCACCTCCTGTGACCTCCACAACCTCAGCTGCTTCCCC 
AGAGGAAGAAGAAGAAAGTGAAGATGAGTCTGAGATTTTGGAAGAGTCGCCCTGTGGGCG 
CTGGCAGAAGAGGCGAGAAGAGGTGAATCAACGGAATGTACCAGGTATTGACAGTGCATA 
CCTGGCCATGGATACAGAGGAAGGTGTAGAGGTTGTGTGGAATGAGGTACAGTTCTCTGA 
ACGCAAGAACTACAAGCTGCAGGAGGAAAAGGTTCGTGCTGTGTTTGATAATCTGATTCA 
ATTGGAGCATCTTAACATTGTTAAGTTTCACAAATATTGGGCTGACATTAAAGAGAACAA 
GGCCAGGGTCATTTTTATCACAGAATACATGTCATCTGGGAGTCTGAAGCAATTTCTGAA 
GAAGACCAAAAAGAACCACAAGACGATGAATGAAAAGGCATGGAAGCGTTGGTGCACACA 
AATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCCCCCATCATCCATGGGAACCT 
GACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAGATTGGCTCTGTGGCTCC 
TGACACTATCAACAATCATGTGAAGACTTGTCGAGAAGAGCAGAAGAATCTACACTTCTT 
TGCACCAGAGTATGGAGAAGTCACTAATGTGACAACAGCAGTGGACATCTACTCCTTTGG 
CATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAATGGAGAGTCCTCATATGT 
GCCACAGGAAGCCATCAGCAGTGCCATCCAGCTTCTAGAAGACCCATTACAGAGGGAGTT 
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CATTCAAAAGTGC^ 

CCACCCAGCATTGTTTGAAGTGCCC^ 

ACACCAACACATGATCCCAGAGAACGCTCiAGA^^ 

TGCCGTACTGGCTGAAATCCCTGCAGGACCAG^ 

TCAGTCACCAGCTCTGGAATTAGATAA^ 

TCTGACAGCCTTTGGGCTGCCTCGGCCC^ 

TGTCGTGCCCCCCTCTGTCAAGACTCCGACAC^ 

GGTGGTGCTGATGCAGTGCAACATTGAGTCGGTGGAGGA 

ACTTCTGCTGAAGTTGGAGGACAAACTG^ 

TGAGAATATCCCCGAGTTGGCGGCTGA^CTGCT^^^ 

CCAGAGCCGGTTGACTTCTCTGCTAGAAGAGAC^ 

CAGTACCCTCAACTCAGCCGCTGTCACCGTC^ 

TGATCTGCGCTGTGGCTGTCCCTGGACGTGCTGCA 

AGTATTACCCTGTGAAGCCCCTTCCCTCCTTTA11A GCA CTTTGTTTACT 
TGGTTCTGAGCATCATCCTTTCCCCTCCCCTCTCTTCCT 

TGTTTTGCACAGACGTGGGCCTGGGCCTTCTCA^ 
CGCTGATCTGCCGGCTCCCGCCCAGCCTGTGTGGAAAGG^A 

GCT 

CTTAAGCAGTTTCTGAAGAA 

aaacgctggtgtacacagatcctct^ 
aagIcctacacttttttgca^ 

GACATCTACTCCTTTGGCATGTG^ 
GGCGAGTCCTCATATGTGCCACAGGAAGCCAT^ 

tcattacagagggagtttattcaaa^gtgcctg^^ 

GCCAGAGAACTTCTGTTCCACCCAGCAC^ 

GCTCACTGTATCGTGGGG^ 
AAGAACATGGATACCAGTGCTGTACTAGCTGAAA 

GTTCAGACTTTGTACTCTCAGTCACCAGCCCT 
AGGAATGGGATCTACCCTCTGACAGCCTTTGGGCT^ 

GAGGAGGTGACATCACCTGTTGTGCCCCCCTC^ 

GAAGTGGAGACACGAAAGGTGGTGCTGATGCAGT^ 

GTCAAACACCATCTAACACTTCTGCTGAAGCTGG^ 

TGTGACCTGATGCCAAATGAGAGCATC^ 

TTCATTAGTGAGGCTGATCAGAGCCGCCT^ 

TTCAACTTCACCAGGAACAGTACACTCAACACAGCCA^ 

TCACTTGAGCCAGGCCCCTAGCCAGGCTGTGGOT 

GTCCCTTCTCCCCAGTCAGTATTACCCTTCG^ 

GCTCCCTGGTTGAGTATCACCCTGCCCCTTCCCCT^ 

TACTTGTTTTGCACAGACGTGGGCCTGGGCCTTC^ 

TAGTAGCTGACCTGCTGCCTCCTGCCCTAC^^ 

SgI?" 

TTTTTGCT 
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SEQ ID NO: 66 AA099102_H 

ATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTG 

GGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTC 

TCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCG 

GGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGAG 

GTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCT 

CTGCAAGAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGC 

TGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTG 

CCCCGGCGGCCGACAGTGGAGTCTCACCACGTCTCCATCACGGGTATGCAGGACTGTGTG 

CAGCTGAATCAGTATACCCTGAAGGATGAAATTGGAAAGGGCTCCTATGGTGTCGTCAAG 

TTGGCCTACAATGAAAATGACAATACCTACTATGCAATGAAGGTGCTGTCCAAAAAGAAG 

CTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCT 

GGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTC 

AAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCCCAATGAG 

GACCATCTGTACATGGTGTTCGAACTGGTCAACCAAGGGCCCGTGATGGAAGTGCCCACC 

CTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATC 

GAGTACTTACACTACCAGAAGATCATCCACCGTGACATCAAACCTTCCAACCTCCTGGTC 

GGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGT 

GACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCT 

GAGACCCGCAAGATCTTCTCTGGGAAGGCCAAGGATGTTTGGGCCATGGGTGTGACACTA 

TACTGCTTTGTCTTTGGCCAGTGCCCATTCATGGACGAGCGGATCATGTGTTTACACAGT 

AAGATCAAGAGTCAGGCCCTGGAATTTCCAGACCAGCCCGACATAGCTGAGGACTTGAAG 

GACCTGATCACCCGTATGCTGGACAAGAACCCCGAGTCGAGGATCGTGGTGCCGGAAATC 

AAGCTGCACCCCTGGGTCACGAGGCATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAAC 

TGCACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGC 

TTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTC 

GAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAACTTGCTCACCAAAAAA 

CCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCT 

CCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCC 

TGCGTGGAAAGTTGCTGGGCCCCCGCCCCCGGCTCCCCCGCACGCATGCATCCACTGCGG 

CCGGAGGAGGCCATGGAGCCCGAGTAG 
SEO ID NO: 67 5R69 17 2 H 

CCGGGATGTGAGCCTGGTGGTTC^CAGCTGGAGCCACGTCGGAGGGGGAAGTGTCGCAGC 

ATTCTCTGCAGGCATCACAGACCTGAGGCAGTGGCCTCCGGAGGGCACTGGACAGAAACA 

GCCATCCAAGTGGCTGAGTGGAGGGACCCTGCTCAAGTGCAGCTGCAGTGGCCGGGGTTT 

CCCTCAGGTAGGGATCGGGGCGCCTTGTCGCCGCCAGCCACGTGTGGCGTCCGCT 

CAGCAGAGTGCAGGGTGCGGGCACCAGGAAAGGGGGCGCAGGGGAACTCCCGCGGGCCTC 

GCGTTTGCAAACTTCTCGCCTGGGCAGGAGGCGGTCGTGGGAAAGAAGGTGGAAGAGCGA 

GCTTTTTGGAACTGTGCACGGGACAGATTGGACGCACACCCCTCGGGAGGCGCGAAGGCA 

TGGAAAATTTGAAGCATATTATCACCCTTGGCCAGGTCATCCACAAACGGTGTGAAGAGA 

TGAAATACTGCAAGAAACAGTGCCGGCGCCTGGGCCACCGCGTCCTCGGCCTGATCAAGC 

CTCTGGAGATGCTCCAGGACCAAGGAAAGAGGAGCGTGCCCTCTGAGAAGTTAACCACAG 

rCATGAACCGCTTCAAGGCTGCCCTGGAGGAGGCTAATGGGGAGATAGAAAAGTTCAGCA 

ATAGATCCAATATCTGCAGGTTTCTAACAGCAAGCCAGGACAAAATACTCTTCAAGGACG 

TGAACAGGAAGCTGAGTGATGTCTGGAAGGAGCTCTCGCTGTTACTTCAGGTTGAGCAAC 

GCATGCCTGTTTCACCCATAAGCCAAGGAGCGTCCTGGGCACAGGAAGATCAGCAGGATG 

CAGACGAAGACAGGCGAGCTTTCCAGATGCTAAGAAGAGATAATGAAAAAATAGAAGCTT 

CACTGAGACGATTAGAAATCAACATGAAAGAAATCAAGGAAACTTTGAGGCAGTGTAAGT 

TATCATGTGCCCTGCTGTTTCTGATGGCCCCCAAACTAGAAGTCATCAGTTTACTGGGAC 
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CCCAGCCTCCCGCTACCCCTGCATTTGTCCATTTTCTGTGCTGGATGGCTGGAAGCAGCC 

CACAGGTTTGGGGATCCATTCATGGCTAGCCCAGGCTTCTGTCCATGGAATAACATGTGG 

AGAGAGCTTCTTGACCAGTAAGATACCTTCTAGCAGCTGTCAAAGTACTTAAAAACCTCT 

ATGAATAGAATCAAAGCTTCAGTTCAGTTGCTGAATTTCCAAGAAGAAATTCAAATCAAA 

TTTAAAATGCCCACTCATTCATTCATTCAACAAAACTGTGAGTATCTGGTTTATGCCAGA 

GGCCATGCAAAGAGGTAACTAAGATGCAGAGAAGGACACTGCCTTCCAGGAGCTCACGGG 

GTGGAGGAGGAAAGAGGAAAGACAGACAGTGAACACACAACAGCAAGGTTACTGAGCTTG 

AACTATGTCCCTAACTACTAGATCTGAAATGACTACGCCAGATGCCAGATGCTCAAGTGC 

CAAGCTCTGGGTAACAGGAATAGACATCCTTCCAGGATGAGAGAGATGAGTCTGGATGAG 

GGTTAAGGCTGGAGGGACAGGCGGGATTTGAAGAGGAGGGAAAGGAAGTGGATGACACAT 

TCTGTTAACTGTCCAGCTGTGTCTCTACTGGTCACTCAGAGGCACGGGAGCCGCTCCCTT 

GGGCTGAGTCCATCAGAAGCCCCAGCCACCACCAGCTCTGGTTCATGTAGTAGAGCTTCC 

CACTCACACATCACAAATATGCCACCTCCCTTAGGACCCCTTCCTCTGCTCATTGACTCT 

TTTGTCTTCTTTCCTCTCGGGGGTGAGGTCAGATTTACCACCAAAATGCATGCAGGAGAT 

rCCGCAAGAGCAAATCAAGGAGATCAAGAAGGAGCAGCTTTCAGGATCCCCGTGGATTCT 

GCTAAGGGAAAATGAAGTCAGCACACTTTATAAAGGAGAATACCACAGAGCTCCAGTGGC 

CATAAAAGTATTCAAAAAACTCCAGGCTGGCAGCATTGCAATAGTGAGGCAGACTTTCAA 

TAAGGAGATCAAAACCATGAAGAAATTCGAATCTCCCAACATCCTGCGTATATTTGGGAT 

TTGCATTGATGAAACAGTGACTCCGCCTCAATTCTCCATTGTCATGGAGTACTGTGAACT 

rrrrACCCTGAGGGAGCTGTTGGATAGGGAAAAAGACCTCACACTTGGCAAGCGCATGGT 

CCTAGTCCTGGGGGCAGCCCGAGGCCTATACCGGCTACACCATTCAGAAGCACCTGAACT 

CCACGGAAAAATCAGAAGCTCAAACTTCCTGGTAACTCAAGGCTACCAAGTGAAGCTTGC 

ArCATTTGAGTTGAGGAAAACACAGACTTCCATGAGTTTGGGAACTACGAGAGAAAAGAC 

AGACAGAGTCAAATCTACAGCATATCTCTCACCTCAGGAACTGGAAGATGTATTTTATCA 

ATATGATGTAAAGTCTGAAATATACAGCTTTGGAATCGTCCTCTGGGAAATCGCCACTGG 

ACATATCCCGTTTCAAGGTGAAGAATGTGAAGACTGGCTCAGCCAGTGGCTGTAATTCTG 

ACAAGATCCGCAAGCTGGTGGCTGTGAAGCGGCAGCAGGAGCCACTGGGTGAAGACTGCC 

CTTCAGAGCTGCGGGAGATCATTGATGAGTGCCGGGCCCATGATCCCTCTGTGCGGCCCT 

CTGTGGATGAAATCTTAAAGAAACTCTGCACCTTTTCTAAGTAGTGTATCAAAATCTAAA 

CCAAGGAGTCTCTGGACAAGAAGCTGGGAGAGGCACGAACTGGACATCTCTCTCTCTCAT 

ATCCTTCGGCATTGGGTTATCTATGGGTGCAAGGAGTGGGCACGCTTCTCTGTTACAAAT 

AGAAAACGATTCCAGTCATACAGGACACATCCCACTCCAAATGATATTTCCAAAAACATA 

CCTCTGACAGTAACTTTGATAGATGGTTTGTCAAATGTATCTTTCTGGGTATCCACACCT 

CTTGGCAATGAAATTTGCAGCTCCTCCCTTCCATAAATGAAGTCTCTTTCCCCACCATTT 

rAATCTGGGCTGGCACTGTGACTTGATTTGATCAATAGAATGTGGAAGAAGTGACTGTAT 

CCCAGTTCCAAGCCTAGGTTTCAAGAGGCCTTATAAATGTCTGTTGGAACCTTACCCAGC 

PATGGACATGTTGAGTGAGCATGCTGGAGAATGAGAGACCACATGAAGCAGAAACATGCT 

TTCCTAGCTGAAGTCATACTAGCCCAACCAACATGGCAGCTAACACATGAATGAGGCCAA 

TCAAGACCAGAAGAACCACTCAAGCAGATCCCAGCCCAAATTGCCCATTCACACAATCAG 

GAGCTAAATAAATTACTGTTGTCTTTT 
qFO ID NO: 68 H85811 H 

CGCCCGGCCCCCTCCCCCGGCGCCGGCCACGGGAGGCGGTGATGCGGGCGCGGGCGGCCT 
CGGCTGCGCCGAGAGCGGAGACACAGGCTCAAGATGGCAGATTCCGACTGAGGCTGGGGG 
rcCCGAGCTCGCGCGCCGCTTTCCCGTCCCCGTTGCCATGAACCGCGGACACCCCGGCCC 
CGATGGCCCCCGTGTACGAAGGTATGGCCTCACATGTGCAAGTTTTCTCCCCTCACACCC 
TTrAATCAAGTGCCTTCTGTAGTGTGAAGAAACTGAAAATAGAGCCGAGTTCCAACTGGG 
ACATGACTGGGTACGGCTCCCACAGCAAAGTGTATAGCCAGAGCAAGAACATCCCCCTGT 
CGCAGCCAGCCACCACAACCGTCAGCACCTCCTTGCCGGTCCCAAACCCAAGCCTACCT^ 
ACGAGCAGACCATCGTCTTCCCAGGAAGCACCGGGCACATCGTGGTCACCTCAGCAAGCA 
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FIGURE 2DDD 

GCACTTCTGTCACCGGGCAAGTCCTCGGCGGACCACACAACCTAATGCGTCGAAGCACTG 
TrAGCCTCCTTGATACCTACCAAAAATGTGGACTCAAGCGTAAGAGCGAGGAGATCGAGA 
ACACAAGCAGCGTGCAGATCATCGAGGAGCATCCACCCATGATTCAGAATAATGCAAGCG 
GGGCCACTGTCGCCACTGCCACCACGTCTACTGCCACCTCCAAAAACAGCGGCTCCAACA 
GCGAGGGCGACTATCAGCTGGTGCAGCATGAGGTACTGTGCTCCATGACCAACACCTACG 
AGGTCTTAGAGTTCTTGGGCCGAGGGACGTTTGGGCAAGTGGTCAAGTGCTGGAAACGGG 
GCACCAATGAGATCGTAGCCATCAAGATCCTGAAGAACCACCCATCCTATGCCCGACAAG 
GTCAGATTGAAGTGAGCATCCTGGCCCGGTTGAGCACGGAGAGTGCCGATGACTATAACT 
TCGTCCGGGCCTACGAATGCTTCCAGCACAAGAACCACACGTGCTTGGTCTTCGAGATGT 
TGGAGCAGAACCTCTATGACTTTCTGAAGCAAAACAAGTTTAGCCCCTTGCCCCTCAAAT 

ACA^CGCCCAGTTCTCCAGCAGGTAGCCAC 

TTATCCACGCTGACCTCAAACCAGAGAACATCATGCTGGTGGATCCATCTAGACAACCAT 
ACAGAGTCAAGGTCATCGACTTTGGTTCAGCCAGCCACGTCTCCAAGGCTGTGTGCTCCA 

cSacttgcagtccagatattacagggcccctgagatcatccttggtttaccattttgtg 

AGGCAATTGACATGTGGTCCCTGGGCTGTGTTATTGCAGAATTGTTCCTGGGTTGGCCGT 

?a?a^Sggagattcggagtatgatcagattcggtatatttcacaaacac^^ 

Sgctgaatatttattaagcgccgggacaaagacaactaggtttttcaaccgtgaca^ 

actcaccatatcctttgtggagactgaagacaccagatgaccatgaagcagagacaggga 

?taagtcaaaagaagcaagaaagtacattttcaactgtttagatgatatggcccaggtga 

Icatgacgacagatttggaagggagcgacatgttggtagaaaaggctgaccggcgggagt 

tcatoacctgttgaagaagatgctgaccattgatgctgacaagagaatcactccaatcg 

aaaccctgaaccatccctttgtcaccatgacacacttactcgattttccccacagcacac 

a^?caaatcatgtttccagaacatggagatctgcaagcgtcgggtgaatatgtatgaca 

cSgtgaaccagagcaaaacccctttcatcacgcacgtggcccccagcacgtccaccaacc 

?gacca^gacctttaacaaccagctgaccactgtccacaaccagccctcagcggcatcca 

tcgctgcLtggcccagcggagcatgcccctgcagacaggaacagcccagatttgtgccc 

ggcctgacccgttccagcaagctctcatcgtgtgtccccccggcttccaaggcttgcagg 

SctccSctaagcacgctggctactcggtgcgaatggaaaatgcagttcccatcgtca 

ctcaagccccaggagctcagcctcttcagatccaaccaggtctgcttgcccagcaggctt 

ggccaagtgggacccagcagatcctgcttcccccagcatggcagcaactgactggagtgg 

CCACCCACACCT^ 

AGCTGGCGGACTGGAGAAATACGCATGCTCACGGAAGCCATTATAATCCCATCATG^ 

agcc?gcactattgaccggtcatgtgacccttccagcagcacagcccttaaatgtgggtg 

TGGCCCACGTGATGCGGCAGCAGCCAACCAGCACCACCTCCTCCCGGAAGAGTAAGCAGC 

Iccaotcatctgtgagaaatgtctccacctgtgaggtgtcctcctctcaggccatcagct 

CCCCACAGCGATCCAAGCGTGTCAAGGAGAACACACCTCCCCGCTGTGCCATGGTGCACA 

Sagcccggcctgcagcacctcggtcacctgtgggtggggcgacgtggcctccagcacca 

cccgggScggcLcggcagacaattgtcattcccgacactcccagccccacggt^ 

?ca?Sc^atcagcagtgacacggacgaggaggaggaacagaaacacgcccccacca^ 

cS?ctccaagcaaagaaaaaacgtcatcagctgtgtcacagtccacgactccccctact 

ccgactcotcagcaacaccagcccctactccgtgcagcagcgtgctgggcacaacaatg 

ccSSSttgacaccaaggggagcctggagaatcactgcacggggaacccccgaacca 

tca^cgtgccacccctgaaaacccaggccagcgaagtattggtggagtgtgatagcctc 

tgccagtcaacaccagtcaccactcgtcctcctacaagtccaagtcctccagcaacgtg^ 

gScgggcccccacttccagcagcagcagccactcaatctcagccaggctcagcagcaca 
?cacca?ggaccgcactgggagccaccgaaggcagcaggcctacatcactcccacc^^ 
ScaggSccgtactccttcccgcacaacagccccagccacggcactgtgcacccgcat^ 
^Sctgcagccgctgccgctgcccacctccccacccagccccacctctacacctacactg 

cgccggcgg^cSS 
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FIGURE 2EEE 

CGCGCCACACCGTGCAGCACACTGCCTACCCAGCCAGCATCGTCCAC^GGTCCCCG^A 
GCATGGGCCCCCGGGTCCTGCCCTCGCCCACCATCCACCCGAGTCAGTATCCAGCCCAAT 
TTGCCCACCAGACCTACATCAGCGCCTCGCCAGCCTCCACCGTCTACACTGGATACCCAC 
TGAGCCCCGCCAAGGTCAACCAGTACCCTTACATATAAACACTGGAGGGGAGGGAGGGAG 
GGAGGGAGGGAGAGAATGGCCCGAGGGAGGAGGGAGAGAAGGAGGGAGGCGCTCCTGGGA 
CCGTGGGCGCTGGCCTTTTATACTGAAGATGCCGCACACAAACAATGCAAACGGGGCAGG 
GGCGGGGGGGGGGGGGCAGAGGGCAGGGGGACGGGTCGGGACACCAGTGAAACTTGAACC 
GGGAAGTGGGAGGACGTAGAGCAGAGAAGAGAACATTTTTAAAAGGAAGGGATTAAAGAG 

GGTGGGAAATCTATGGTTTTTATTTTAAAAAAG 

CGGGAGCGAAAGTGCGCTGAGCTGCAGTGTCTGGTCGAGAGTACCCGTG 

cSSgag^ 

CTCCCCGAGCGGGGTCCCGAGCTAGGAGATGGGAGGCACAGCTCGTGGGCCTGGGCG^ 
rGATGCGGGGCCGCCTGGGGCCGGGCTCCCGCCCCAGCAGCGGAGTTGGGGGATGGTGTC 

?a?gaS?S?catgatgatagatgaaaccaaatgt 

AATC^^CTGAACCACCTCCACCCAGAAGACTAAATATGACCGCTGAGCAGTTTACAGGA 
GATCATACTCAGCACTTTTTGGATGGAGGTGAGATGAAGGTAGAACAGCTGTTTCAAGAA 
TTTGGCAACAGAAAATCCAATACTATTCAGTCAGATGGCATCAGTGACTCTGAAAAATG^ 
TCTCCTACTGTTTCTCAGGGTAAAAGTTCAGATTGCTTGAATACAGTAAAATCCAACAGT 
TCATCCAAGGCACCCAAAGTGGTGCCTCTGACTCCAGAACAAGCCCTGAAGGAATATAAA 
rACCACCTCACTGCCTATGAGAAACTGGAAATAATTAATTATCCAGAAATTTACTTTGTA 

Stc2aaI?gS^aaaagacatggagttattggtggtc 

gatcca^tggggcctatattcatgtacctcgagaccatctagcttatcgatatgaggtg 
Sgaaaa^a??ggcaaggggagtittgggcagg 

cgaSgtacgtggccctaaaaatggtgcgcaatgagaagcgctttcatcgtcaagca^ 

gaggagSccLattttggagcatcttaagaaacaggataaa^ 
a^caStcctggaaagtttcacattccggaaccatgtttgcatg 
agcSagacctttatgagctgattaaaaaaaataagtttcagggtt 
gtacgc^gtttgcccagtccatcttgcaatctttggatgccctccacaaaaataagatt 

aSSctccgatctgaagccagaaaacattctcct^ 
aIgg^Sttgactttgggtccagctgtttcgagtaccagaagctctacacatatatc^ 

tc??gg?tctacagagctccagaaatcatcttaggaagccgctacagcacaccaat^ 

arrrCAGCTCAAGCATTAAGACACCCTTGGATTAGCAAGTCTGTCCCCAGACCTCTCACC 

accaSgS^ 

GCTTCTAAGCTGCCTCCAGTTGTTGGAATAGCCAATAAG 

gaaIcca^tggtagtatacccctatgcagtgtattgccaaaactgattagcta^^ 

GAG^TATOCCCAGAGATGCATATGTGTATATTTTTATGATCTTACAAACCTGCAAATGGA 

aSt^aagcccattggtggatgtttttgttagagtagactttttttaaacaagacaa 

A^CATTTTTATATGATTATAAAAGAATTCTTCAAGGGCTAATTACCTAACCAGCTTGTAT 
TGGCCATCTGGAATATGCATTAAATGACTTTTTATAGGTCA 
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FIGURE 2FFF 
SEQ ID NO: 70_AA58924 1_M DYRK3_M 

CCACGCGTCCGGAGTTGCTAGGAATGCCACCGCAGAAACTTCTGGAGCAATCCAAGCGTG 
CCAAGTACTTTATTAACTCCAAAGGCTTGCCTCGATACTGCTCCGTATCTACCCAGACGG 
ACGGGAGGGTGGTGCTTCTCGGGGGTCGCTCACGCAGGGGTAAAAAGCGAGGCCCGCCAG 
GCAGCAAAGACTGGGCAACCGCACTGAAGGGCTGTGGTGACTACTTGTTCATAGAGTTTC 
TGAAACGATGCCTCCAGTGGGACCCCTCTGCCCGCCTCACCCCGGCTCAAGCATTAAGAC 
ATCCTTGGATTAGCAAGTCTACACCCAAACCTCTCACCATGGACAAGGTGCCAGGGAAGC 
GGGTAGTTAACCCTACAAATGCTTTCCAGGGACTGGGTTCCAAGCTGCCTCCAGTCGTTG 
GGATAGCCAGTAAGCTTAAAGCTAACCTAATGTCCGAAACCAGTGGTAGTATACCTCTGT 
GCAGTGTATTGCCAAAGCTGATTAGCTAGTGGACCACTCAGAGACTGATACATATCATAT 
GTATTTTTAATTACCTTGCAAACATGCAAATGGAAAACGGAATAATTGAAGCCCATTCAC 
TGATGGATATGTTTTTGTTAGACTTTTTTTTAACAAGGCAGAACATTTTTATATGACTAT 
AAAAGAACGCTTCAAGGGCTAATGTCAAACCAGCTTGTATTGGCCATCTGGAGTATACAT 
TAAATGACTTTTTCATAGGTC 

SEQ ID NO: 71_5R72_1 6_2_H 

GTCGAGGCGCAGCGCTGCCATGGCTGGGGGCCGTGGGGCCCCCGGGCGCGGCCGGGACGA 
GCCTCCGGAGAGCTACCCGCAACGACAGGACCACGAGCTACAGGCCCTGGAGGCCATCTA 
CGGCGCGGACTTCCAAGACCTGCGGCCGGACGCTTGCGGACCGGTCAAAGAGCCCCCTGA 
AATCAATTTAGTTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGTATATGTAAAAGTGGA 
TTTGAGGGTTAAATGCCCACCTACCTATCCAGATGTAGTTCCTGAAATAGAGTTAAAAAA 
TGCCAAAGGTCTATCAAATGAAAGTGTCAATTTGTTAAAATCTCGCCTAGAAGAACTGGC 
CAAGAAACACTGTGGGGAGGTGATGATCTTTGAACTGGCTTACCACGTGCAGTCATTTCT 
CAGCGAGCATAACAAGCCCCCTCCCAAGTCTTTTCATGAAGAAATGCTGGAAAGGCGGGC 
TCAGGAGGAGCAGCAGAGGCTGTTGGAGGCCAAGCGGAAAGAAGAGCAGGAGCAACGTGA 
AATCCTGCATGAGATTCAGAGAAGGAAAGAAGAGATAAAAGAAGAGAAAAAAAGGAAAGA 
AATGGCTAAGCAGGAACGTTTGGAAATTGCTAGTTTGTCAAACCAAGATCATACCTCTAA 
GAAGGACCCAGGAGGACACAGAACGGCTGCCATTCTACATGGAGGCTCTCCTGACTTTGT 
AGGAAATGGTAAACATCGGGCAAACTCCTCAGGAAGGTCTAGGCGAGAACGTCAGTATTC 
TGTATGTAATAGTGAAGATTCTCCTGGCTCTTGTGAAATTCTGTATTTCAATATGGGGAG 
TCCTGATCAGCTCATGGTGCACAAAGGGAAATGTATTGGCAGTGATGAACAACTTGGAAA 
ATTAGTCTACAATGCTTTGGAAACAGCCACTGGTGGCTTTGTCTTGTTGTATGAGTGGGT 
CCTTCAGTGGCAGAAAAAAATGGGTCCATTCCTTACCAGTCAAGAAAAAGAGAAGATTGA 
TAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAATTCAACTCACTGGTAAAATTGAG 
CCATCCAAATGTAGTACGCTACCTTGCAATGAATCTCAAAGAGCAAGACGACTCCATCGT 
GGTGGACATTTTAGTGGAGCACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTC 
AGGCCCCATCCCTGTGCATCAGCTTCGCAGGTACACAGCTCAGCTCCTGTCAGGCCTTGA 
TTATCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCTTGGTGGA 
TGCAGAAGGCACCGTCAAGATTACGGACTATAGCATTTCTAAGCGCCTCGCAGACATTTG 
CAAGGAGGATGTGTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAA 
AACGGGGAAGAAAGGAGATGTTTGGCGTCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGG 
ACAGGAATGTGGAGAGTACCCTGTGACCATCCCTAGTGACTTACCAGCTGACTTTCAAGA 
TTTTCTAAAGAAATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTCCCCAGCAGTTGTT 
GAAACACAGCTTTATAAATCCCCAGCCAAAAATGCCTCTAGTGGAACAAAGTCCTGAAGA 
TTCTGGAGGACAAGATTATGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGC 
CTTCTTTAGTGAGACACAGAGACAGTTTTCCCGATACTTCATTGAGTTTGAAGAATTACA 
ACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATCAAGGTGCAGAACAAGTTGGACGGCTG 
CTGCTACGCAGTGAAGCGCATCCCCATCAACCCGGCCAGCCGGCAGTTCCGCAGGATCAA 
GGGCGAAGTGACACTGCTGTCACGGCTGCACCATGAGAACATTGTGCGCTACTACAACGC 
CTGGATCGAGCGGCACGAGCGGCCGGCGGGACCGGGGACGCCGCCCCCGGACTCCGGGCC 
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FIGURE 2GGG 

CCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGCGACACAGACGGCCTGGA 

CAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 

GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGA 

GGACGACGACGAGGACGAGCACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGA 

TTCTGAAAGTGATATTATCTTTGACAATGAAGATGAGAACAGTAAAAGTCAGAATCAGGA 

TGAAGATTGCAATGAAAAGAATGGCTGCCATGAAAGTGAGCCATCAGTGACGACTGAGGC 

TGTGCACTACCTATACATCCAGATGGAGTACTGTGAGAAGAGCACTTTACGAGACACCAT 

TGACCAGGGACTGTATCGAGACACCGTCAGACTCTGGAGGCTTTTTCGAGAGATTCTGGA 

TGGATTAGCTTATATCCATGAGAAAGGAATGATTCACCGGGATTTGAAGCCTGTCAACAT 

TTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGGTTTGGCGACAGACCATCT 

AGCCTTTTCTGCTGACAGCAAACAAGACGATCAGACAGGAGACTTGATTAAGTCAGACCC 

TTCAGGTCACTTAACTGGGATGGTTGGCACTGCTCTCTATGTAAGCCCAGAGGTCCAAGG 

AAGCACCAAATCTGCATACAACCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTT 

TGAGATGTCCTATCACCCCATGGTCACGGCTTCAGAAAGGATCTTTGTTCTCAACCAACT 

CAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGACGATGGAGAGCATGCAAAGCA 

GAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCACAGCCACAGA 

GCTGCTCAAGAGTGAGCTGCTGCCCCCACCCCAGATGGAGGAGTCAGAGCTGCATGAAGT 

GCTGCACCACACGCTGACCAACGTGGATGGGAAGGCCTACCGCACCATGATGGCCCAGAT 

CTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATGACAGCGACATACTGAAGGG 

CAACTTCTCAATCCGTACAGCCAAGATGCAGCAGCATGTGTGTGAAACCATCATCCGCAT 

CTTTAAAAGACATGGAGCTGTTCAGTTGTGTACTCCACTACTGCTTCCCCGAAACAGACA 

AATATATGAGCACAACGAAGCTGCCCTATTCATGGACCACAGCGGGATGCTGGTGATGCT 

TCCTTTTGACCTGCGGATCCCTTTTGCAAGATATGTGGCAAGAAATAATATATTGAATTT 

AAAACGATACTGCATAGAACGTGTGTTCAGGCCGCGCAAGTTAGATCGATTTCATCCCAA 

AGAACTTCTGGAGTGTGCATTTGATATTGTCACTTCTACCACCAACAGCTTTCTGCCCAC 

TGCTGAAATTATCTACACTATCTATGAAATCATCCAAGAGTTTCCAGCACTTCAGGAAAG 

AAATTACAGTATTTATTTGAACCATACCATGTTATTGAAAGCAATACTCTTACACTGTGG 

GATCCCAGAAGATAAACTCAGTCAAGTCTACATTATTCTGTATGATGCTGTGACAGAGAA 

GCTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTCTAATAGTCT 

GTGTCGACTCTACAAGTTTATTGAACAGAAGGGAGATTTGCAAGATCTTATGCCAACAAT 

AAATTCATTAATAAAACAGAAAACAGGTATTGCACAGTTGGTGAAGTATGGCTTAAAAGA 

CCTAGAGGAGGTTGTTGGACTGTTGAAGAAACTCGGCATCAAGTTACAGGTCTTGATCAA 

TTTGGGCTTGGTTTACAAGGTGCAGCAGCACAATGGAATCATCTTCCAGTTTGTGGCTTT 

CATCAAACGAAGGCAAAGGGCTGTACCTGAAATCCTCGCAGCTGGAGGCAGATATGACCT 

GCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCCCACTGCCATTGGGGT 

CAGCATAGCTATAGACAAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTTACAAT 

AAGCTCTTGTGACCTCCTGGTTGTAAGTGTTGGTCAGATGTCTATGTCCAGGGCCATCAA 

CCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAATCATGTACGACTGGTCACA 

GTCCCAAGAGGAATTACAAGAGTACTGCAGACATCATGAAATCACCTATGTGGCCCTTGT 

CTCGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTTTCGAGAAGGAAAGGCAGACAGA 

GAAGCGTGTGCTGGAGACTGAACTTGTGGACCATGTACTGCAGAAACTGAGGACTAAAGT 

CACTGATGAAAGGAATGGCAGAGAAGCTTCCGATAATCTTGCAGTGCAAAATCTGAAGGG 

GTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCAACAGTGGTTCCCATTGT 

GAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACTCAGGT 

ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGCAGTGAAATTGAAAT 

TCTGGCTGTGGATCTACCCAAAGAAACAATATTACAGTTTTTATCATTAGAGTGGGATGC 

TGATGAACAGGCATTTAACACAACTGTGAAGCAGCTGCTGTCACGCCTGCCAAAGCAAAG 

ATACCTCAAATTAGTCTGTGATGAAATTTATAACATCAAAGTAGAAAAAAAGGTGTCTGT 

GCTATTTCTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAACCCTAAAGAAC 

TGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTGGAATAATGGAATGTTGTACAT 
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FIGURE 2HHH 

TCATCATAATTTAAAATTAAATTCTAAGAAGAGGCTGGGTGCAGTGGCTCACACCTTTAA 
TCCCAGCACTTTGGGAAGCCAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACCAG 

CCT 

SEQ ID NO: 73_R43524_H, HRI_H 

ATGCTGGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGACGGGGCTGGGGCT 

GTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACCCCGAATATGACGAA 

TCTGATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAACAGCCAACCTTCCCT 

TTTGCAGTTGCAAACCAACTCTTGCTGGTTTCTTTGCTGGAGCACTTGAGCCACGTGCAT 

GAACCAAACCCACTTCGTTCAAGACAGGTGTTTAAGCTACTTTGCCAGACGTTTATCAAA 

ATGGGGCTGTTGTCTTCTTTCACTTGTAGTGACGAGTTTAGCTCATTGAGACTACATCAC 

AACAGAGCTATTACACACTTAATGAGGTCTGCTAAAGAGAGAGTTCGTCAGGATCCTTGT 

GAGGATATTTCTCGTATCCAGAAAATCAGATCAAGGGAAGTAGCCTTGGAAGCACAAACT 

TCACGTTACTTAAATGAATTTGAAGAACTTGTCATCTTAGGAAAAGGTGGATACGGAAGA 

GTATACAAGGTCAGGAATAAATTAGATGGTCAGTATTATGCAATAAAAAAAATCCTGATT 

AAGGGTGCAACTAAAACAGTTTGCATGAAGGTCCTACGGGAAGTGAAGGTGCTGGCAGGT 

CTTCAGCACCCCAATATTGTTGGCTATCACACCGCGTGGATAGAACATGTTCATGTGATT 

CAGCCACGAGCAGACAGAGCTGCCATTGAGTTGCCATCTCTGGAAGTGCTCTCCGACCAG 

GAAGAGGACAGAGAGCAATGTGGTGTTAAAAATGATGAAAGTAGCAGCTCATCCATTATC 

TTTGCTGAGCCCACCCCAGAAAAAGAAAAACGCTTTGGAGAATCTGACACTGAAAATCAG 

AATAACAAGTCGGTGAAGTACACCACCAATTTAGTCATAAGAGAATCTGGTGAACTTGAG 

TCGACCCTGGAGCTCCAGGAAAATGGCTTGGCTGGTTTGTCTGCCAGTTCAATTGTGGAA 

CAGCAGCTGCCACTCAGGCGTAATTCCCACCTAGAGGAGAGTTTCACATCCACCGAAGAA 

TCTTCCGAAGAAAATGTCAACTTTTTGGGTCAGACAGAGGCACAGTACCACCTGATGCTG 

CACATCCAGATGCAGCTGTGTGAGCTCTCGCTGTGGGATTGGATAGTCGAGAGAAACAAG 

CGGGGCCGGGAGTATGTGGACGAGTCTGCCTGTCCTTATGTTATGGCCAATGTTGCAACA 

AAAATTTTTCAAGAATTGGTAGAAGGTGTGTTTTACATACATAACATGGGAATTGTGCAC 

CGAGATCTGAAGCCAAGAAATATTTTTCTTCATGGCCCTGATCAGCAAGTAAAAATAGGA 

GACTTTGGTCTGGCCTGCACAGACATCCTACAGAAGAACACAGACTGGACCAACAGAAAC 

GGGAAGAGAACACCAACACATACGTCCAGAGTGGGTACTTGTCTGTACGCTTCACCCGAA 

CAGTTGGAAGGATCTGAGTATGATGCCAAGTCAGATATGTACAGCTTGGGTGTGGTCCTG 

CTAGAGCTCTTTCAGCCGTTTGGAACAGAAATGGAGCGAGCAGAAGTTCTAACAGGTTTA 

AGAACTGGTCAGTTGCCGGAATCCCTCCGTAAAAGGTGTCCAGTGCAAGCCAAGTATATC 

CAGCACTTAACGAGAAGGAACTCATCGCAGAGACCATCTGCCATTCAGCTGCTGCAGAGT 

GAACTTTTCCAAAATTCTGGAAATGTTAACCTCACCCTACAGATGAAGATAATAGAGCAA 

GAAAAAGAAATTGCAGAACTAAAGAAGCAGCTAAACCTCCTTTCTCAAGACAAAGGGGTG 

AGGGATGACGGAAAGGATGGGGGCGTGGGATGA 
SEQ ID NO: 74 17000057519457_H 

CACAAGAGCCCTTCCTGCAGGGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGT 
AACCACTTACAGGCCGGAAGTGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCA 
GGATTGCCGAAGAAGTCCCAGGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAG 
AGACAGCTGATCGGTTGGAGCTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTA 
CGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGC 
GGAGCAGCCGCTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGT 
TCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACC 
GGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGC 
TCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTT 
CCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGT 
CCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGC 
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FIGURE 2III 

AGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACA 

TGCTCCTGAAACCCCCCCTGGAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTT 

TCATTTCAGCACTTCCAGAGGATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCC 

TCAGTACCCATCCCAACACTGAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCT 

CCTCCAAAAAGGCCAGGCCAGTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAA 

AGAGGTCCATGGTTGGGTAGAAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTT 

TCAAAGTAAATTTGAAGAAATGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAG 

ATATTTTTAAGTGGTATGTGATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACT 

ATGTGTCTAAGTCATGTTCTAGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTT 

CTCCCAGATTGTGACATGTATATCTCAGATACATGGGTGTGGCATTGAACCACATAATGA 

GAACATTATTCTCTTTTTAGTCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTC 

GCTGAGCTTACTGGCCCTCTAACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTAT 

ATTTATTTTGAAACCAGTTTAATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATA 

CAGCATGGAAAATATCAGTGTATTGTTTTATGAAACTTTCACGTGTATATATAGACCAAG 

GATATGTGCTGAGTTTTGATGTCAAATATATTTCTCTTTCAGGGTCATGATCAAAAAATG 

AAAAGTCTGCTTAACTCCAATTTCTCTTTTAAAAAAGCAGACTTACAGCTTTCAGGCAAC 

TGAAATTCATGTTAACATGTTTTTATTTTTATTGCTTTGTATTTTTGTGGTTACCTTCTA 

AGACAAGTGATTGATCTAAAGT.TCCTTTTAAGTTTATACCGCTAAACAAACTGAGTTGAT 

TTCTATCACAGGCAGTAAGTAGGTAGAGCAAAAATGGTGAAGTGACTTGTGAAGACTGAA 

GTTTGATGAAGTCTGGTTTAAGGCACAGGTAAACTGAGTGTGGATGCAAAAGTACCAGGA 

GCTAGCTTTTAACCTTGCCCAGCCTCAGTTTCTTTTCTTAGAAGAAGCTATGTTTGGGTG 

GGAAGGGAAGAGAGGGATAAGAAAATACCTTTCTTCCTTGTAAACTCCAATCAACAAACA 

TATTTTGAGTGCCTTTTGTGTTCCTTGGCACCCTGTTGGGTATTGGGTACTTGGCACCCT 

GTTGGGTATTGGGTACAATGGTGAGCCAGACAGACACAGCGCCTGTCCTTTTGTAAGAAT 

ATTTATTTTTATAAAAAAGTATAAAGTATACAGTGGGATGTTTTGATATACATTATGAAA 

TGATTGCTACAGCTGAGCTAATTAACACCCATCACCTCACATAGTTACTGTCTTGTTTCT 

TAATATGGACATTTGCAGCTATGAATTTCCCTCTGCACACTGTTGTCATCACACACTCTC 

AGTTTTGGTATTTTGTGTTTTTGTTTTCATTCATCTCAAAGTATTTTCTAATTTCCCTTG 

TGATTTCTTCTTTGACCCCTTGATTGTTTAGAAATCTGTTAATTTCCACACATTTGTAAA 

TGTTCCAATTTTTCTTTTGTTATTGCCAGCTTCATTCCATTGTGTTCAGAGATGATACAG 

TCAGTGCCTGTTCTTATGAAGCAAACATTCTATAATAGTAGGACCAGTACCCTGTCTGTT 

TCATTCACCACAGTCAGCATGCCCCAAGTGCCCAGCATGGGGCGGATGGCCAGGAATGAG 

TGAAAACTTCCCTTCCTGGGTAGTTGTGACTAGTAGAGAGGAAAAATAATATAATTGCCT 

GCTTACTGCATGCCAGGCATTGGGCTGGGAATTTTTATATTGGATCTAAAATAACTCTTA 

AGTTAGGCATTATCCCCATTTTATAGATGGAGAAACTGGCCCCAAAAGGTGGGAACTTGT 

CCAAGACGTCACAGGTAGCAAGAGGTACTTTTACCTGGCTCCAAATCTGTGTTCTTTCCA 

CTGACAAATGAGATATGGGATATGGTGCATCTTTACAGTACTATAATAAGTATTGGCGTA 

TAACATTATTTTCAAGGAACTCCAAGGGCCACAGGAGCTGACAGGTTTTTCAATTAATAT 

TCCCAACATGAATGAGATGCCTCATTCCTCAGTTTCCTCACGTGTACTATAAGGCTAGTA 

CCTGCTTTGTTGGGGTATGGTTGGCTCGTGTGCATTAAGTCAACAAATCCCTAGT 

SEQ ID NO: 75_AA013524_M 

CTGGTGCAGCAGGGCGCCGAGGCGCGCGTTTTCCGTGGCCGCTTCCAGGGCCGCGCGGCC 
GTGGTGAAGCACCGCTTCCCGAAGAGTTACCGGCACCCGGAGCTGGAGGCGCGGCTCGGC 
CGTCGGCGGACGGTGCAGGAGGCGCGCGCGCTGCTCCGCTGCCGCCGTGCGGGGATAGCT 
GCCCCAGTCGTCTTCTTTGTGGACTATGCGTCTAACTGCTTATATATGGAAGAAATCGAA 
GACTCGGTGACTGTTCGGGATTATATCCAATCCACTATGGAGACTGAAAAGGACCCCCAG 
TGCCTCTTGGACCTGGCCAGGAGGATGGGGCAGGTTCTGGCCGGAATGCACGACCAAGAC 
CTCATTCACGGGGACCTCACCACCTCCAACATGCTCCTGAGGCGGCCCCTGGCGCAGCTG 
CACATCGTGCTCATCGACTTTGGGCTGAGCTTTGTCTCAGGACTGCCGGAAGATAAAGGC 
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FIGURE 2JJJ 

CTCGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGCACGCACCCCCACACCGAGACCGCG 
TTTGAAGCCTTTCTGAAGAGTTACGGGGCCTCGTCCAAGAAGTCCAGTCCAGTGCTGAAG 
AAGTTAGATGAGGTGCGCCTGAGAGGGCGAAAGCGGTCCATGGTCGGGTAGTGGAGCTGT 
GGTGAACTGGCTCACGGTGAAGGATGATGTAGACGAGGCTGGACCCCTCAGCAAAGCATG 
GGTTGTTAAGTGGTCTGTGATCGTGCTGGGCCACCACCATCCATGGCTCACTGTTCTCAG 
GGGCTTCATGTACATGAGGTTTATTCTGGGCAGAACTGGGTAGGTAGCCCAGGCTAGCCT 
TGAATTTATGGCAACATCCTACCTCAGCTTGCTTGGAAGAGGTTATAAGCCACCATACCT 

GACTTTGCACTGATTCTGTCAGAAAC 

SEO ID NO: 76 17000139801197_H, IRAKM_H 

ATGGCGGGGAACTGTGGGGCCCGCGGCGCGCTGTCGGCGCACACGCTGCTGTTCGACCTG 
CCGCCCGCGCTGCTCGGAGAGCTCTGCGCTGTTCTGGACAGCTGCGACGGCGCGCTGGGC 
TGGCGCGGCCTGGCAGAGAGACTTTCAAGCAGCTGGCTGGATGTTCGTCATATTGAAAAG 

S?g?agaccaIggtaaaagtggaacaagag^ 

Sgaccatcggtgaccttttacaggtcctccaggagatgggacatcgtcgagctattcat 
™ttacaaactatggagcagtgttgagtccttcagagaagagttatcaggaaggtgga 
?ttccaaatatattattcaaggaaacagccaatgtcaccgtggataatgttcttattcct 

gSca^tgaaaaaggagtactgcttaaatcttccatcagctttcaaaata^ 
ggaactagaaatttccacaaagacttcctaattggagaaggagagatttttgaggtatac 

ag^ggag^ttcaaaacctaacatatgctgtcaaat^^ 
caS?g?aagaagcattggaagaggtttttatctgagcttgaagttttactactgtttcat 

ca^caa^a^^ 

ta?cc™catgagaaatggaacactttttgacagattgcagtgtgtaggtgacacggcc 

ccactcccttggcacattcgaatcggtatattaataggaatatccaaagccattca^ 

ctcScaacgttcaaccatgctcggtcatctgtggcagtatatcaagtgcaaacatcctt 

ttgga^tcagtttcaacccaaactaactgattttgccatggcacacttccggtcccac 

Sagaacatcagagttgtaccataaatatgaccagcagcagcagtaaacatctgtg^ 

atScSgaagagtacatcagacaggggaaactttccattaaaacagatgtctacagct^ 

ggSSg^taatggaagttctaacaggatgtagagtagtgttagatgatccaaaacat 

a?c^Ig?™g^atctccttagagaattgatggagaagagaggcctggattcatgtct 

SSgSggccggtgtgctgcaacgcgggcaaagttaagaccatcaatg 
II?ac????gaaagtactcaagccagcttgtattttgctgaagatcctcccacatca^ 
a^gtccttcaggtgtccttctcctctattcctggagaatgtaccaagtattccagtggaa 
g1?ga?gaaagccagaataacaatttactaccttctgatgaaggcctgaggatagacaga 

atgactca^a^aactccttttgaatgcagccagtctgaggttatg^ 
aaaaagccagagagcaagagaaatgaggaagcttgcaacatgcccagttcttcttgtgaa 

^??ggttcccaaagtatatagttccatcccaggacttaaggccctataaggta^t 

a?agatccttcttcagaagctccagggcattcttgcaggagcaggccagtggagagcagc 

tgttcctccaaattttcctgggatgaatatgaacagtacaaaaaagaataa 



qFO ID NO: 77 AA840598 M IRAKM M 

atgtggaagagatttttatcagaactggaagttctactcctgttccgtcacccccaca 
cSgagct^ 

AGCAACGGGACGCTTTTCGACAGATTACAGTGCACAAATGGCACAACCCCGCTTTCCT 

cac^Sgaatcagcgtattgataggaatagccaaagccatccaatacttc 

CAGCCGTGCGCCGTCATCTGTGGCAACGTTTCCAGTGCAAACATACTCTTGGATGACCAG 

SccSScaaactaacggattttgctgcagcgcacttccgacccaatctagag^ 
agSctac^^ 
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FIGURE 2KKK 

ATGGAGGTTCTAACGGGCTGCAAAGTGG^ 

GACCTCCTCATGGAACTGATGGAGAAAA^^ 
AGGAAGATACCACCCTGTCCTCGGAACTTCTCTG^ 

TGTGTGGCAACGAAGGCCAAGTTAAGAC^ 
AGCACCCAGCCTAGCTTGTATTTTGCAGAAGA^ 

TGTCCTTCTCCACTGTTCTTGGA^ 
CAGAATAACCATTCAGTACCTCCCAAGGAAGTT^ 



CCCCTTTGAATGCAGCCAGTCTG^ 
CAGGGGAAGTGAAGCGGATT^ 

TGGGAAGTACCAGGCCATTCTTATGGGAGCA^GCCAA^ 



AO 

CCAGAGCTTG' ■GOOCCATCCCJ^rw --^ ^ , ; , rCTGGG 



AGCTCTGRGCTCCGTGAGTACAG^GGGACCMC TCAAGRGATGC TTTAT 



TCAAGTC^CCGCCTCTCAGTCA^ 

rCATCATCCGAGTGAGATCTTGG: 
3AACCACCTTCTCCCCAAGCCAG( 
3CAGCCAATCATGATACAGTGTCi 

TGCTAGGGATACTGACAGTCTATTTGCTTCCCAI^ TGGAAGGAA TTGTGT 

TTATATTAAAGAATTCCAGCACT 

SEO ID NO: 7B_AA086547 H CCCCGGCTGGGG CTCCAGCTCCAGTTC 
ATGGCGAGTGCGGTGAGGGGGTCGAGGGCGTGGCCCC^ TCAGGCCA GAGAAGCTC 
GCGGCGCTGCTGCTCGGGACGCTGAGTCCACAGGTTCATACT 

CTGCTGGTGTGGAGCTTGGATGGAAGTCTCC^GCA^ 
ftJ^GCACTGGTCCACACAGGAGTG^CCCT 



GATGGCCCCAC 
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FIGURE 2LLL 

GGGGCCAGCCGGAGGAGCCAG^GAGGCTTCA^ 

GACGACCCTGAAGCTGAGCAACTCACCGT^ 

GTGCTGGGCCGCGGGGCAGGCGGGAC^ 

GTGGCTGTCAAGCGGCTCCTCCGCGAGTGCTTTbO CCGAGCGGGGA CCC 

SgcaLagtctgacaggcac^ 
Sgttccactacattgccctggagctctgc^ 

atcaccgggcctgacagccagggcctgggc^a^gact 
aIgaagctgcctgctggccgct^ 

ggaSctttatcgccaggcaaa™ 
gccaagcaactccagttcttccaggacgtcagtgac^ 

gggaggtga 

SEQ ID NO: 79_HGP_66444 66 TGTGTTGGT GCTAGAGGCAGCTGC 

Sagggttcgaattgcaacggcagct^ 
agggtctcgctgggggccgctcgggacca^tt^ 

ttcacctccgacctttccttccaggcggt^ 
ggaagggatcagtaatttcaagacaccaag^ 
Sgttcaactccaactataaatatcc^ 
tIctggggtaaatgtgtacct^ 



WO 00/73469 



PCT/USOO/14842 



FIGURE 2MMM 

TAATGAAGACCCTAAAGATCGTCCTTCTGCTGCACACATTGTTGAAGCTCTGGAAACAGA 
TGTCTAGTGATCATCTCAGCTGAAGTGTGGCTTGCGTAAATAACTGTTTATTCCAAAATA 

TTTCTTGTTAACATATGGATAACTATTTCTAATATGAAATATGCTTATATTGGCTATAAG 

ScSggaI^gtIctgggttttctgtaa^^ 

SaSgcSgctgacttaaaacactagcagtaaaacgctgta^ 

?gSgaccattacttttattaatgatctttcttaaatattctatattttaatggat 

ctgacattagcactttgtagagtacaaaataaagtctacatttgtttaaaacactg 

S??Stgatgtgtttatcaaatgataactggaagctgaggagaatatgc 

gSg?Scttggatacttcagactctggttacagattgtcttgatctcttggatct^ 

agaSSggtttttgct^ 

^tttgtaccttaagcatttcccagctgtgtaaaaacaataaa^^ 
taaagaataaaggacactttgggtaccagaaggtgtctcagcattattttatacttc 

Sctccaa'gag^ 

tgSca^atcgaactgtgtatcagaagagactaactgatgaagctaagatt^ 
JStS^ccaaacattataggatatcgtgcttttactgaagccagtgatgg^ 

ct^cttcctatggagtatggaggtgaaaagtctctgaa^^ 
2aa^SSgtggaagtccttttccagcagctgtaattctcagagttgct^ 

ra^GGGCTAAAGTACCTGCACCAAGAAAAGAAGCTGCTTCATGGAGACATAAAGTCTTC 
A^^G^A^GGTGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCTC^ 

^StcgItcaaaatatgactgtgactgatcctgaggcctgttatattggtactgagcc 

SggaIIScaa^gcgttggaagaaaatggcatca™ 

?gc?™gcc^?actctgtgggaaatgatgactttatgtattccacac 

agatgatgatgttgatgaagatgcaacctttgatgagagtgacttcgatgatgaagca^ 

™gSgctctggggacaaggccatccatcaacatg 
?Sggc2aSgaactcttctgtgtgtgcactaatgaggatcctaaag 

?g^a2aca?cg™Sgctttggaactagatggccaatg 

™c?Statgggaactgttaactagatatatgtagttaatataacttatgg 

attctagaagtagctttaacactagtgacccctgtctaagatgacttaagaatcaaggga 

rCATTGCTTTGTTACAGATCTTTTTAGATATTCTTGCTTCTTTAGTGGGTTACTAAAAAT 

TTGCCTCTCTACACTTAGCTTCTAATTTATTACTTTGTTTCTACTGATTGTGTCTTAAGT 
CTTTTAAAATAAATGTAAGAATAAACAATAAAAGACAGTTTTAGTACCAGG 

oun m NO- 81 5R57 10 2 M TESK2 M 

GCTGCTGGACAGTGACTTGTATTTACCGTGGACTGTGAGAGTGAAACTGGCCTATGG^ 
AGCAGTG^GCCTCAGCTACCTTCACTTCAAAGGCATTTTCCATCGGGACCTCAC^ 

GGTGTGAAGGCTTTGCTTTC 

tlr^rGGTGGAGGAAGTTTTGGGAGTGTTTATCGAGCCAAATGGATATCACAGGACAAG 

^GGTGGCTGTA^GAAGCTCCTCAAAATAGAGAAAGAGGCAGAAAT^ 

AGTCACAGAA^C^TCATCCAGTTTTATGGAGTAATTCTTGAACCTCCCTAC 

GTCACAGAA^ATGCTTCTCTGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAG 

rAGATGGATATGGATCACATTATGACCTGGGCCACTGATGTAGCCAAAGGAATG 

??acI?a?gg!ggc?S 



WO 00/73469 



PCTAJS00/14842 



FIGURE 2NNN 

ATAGCTGCTGATGGAGTATTGAAGATCTGTGACTTTGGTGCCTCTCGGTTCCATAACCAT 

ACAACACACATGTCCTTGGTTGGAACTTTCCCATGGATGGCTCCAGAAGTTATCCAGAGT 

CTCCCTGTGTCAGAAACTTGTGACACATATTCCTATGGTGTGGTTCTCTGGGAGATGCTA 

ACAAGGGAGGTCCCCTTTAAAGGTTTGGAAGGATTACAAGTAGCTTGGCTTGTAGTGGAA 

AAAAACGAGAGATTAACCATTCCAAGCAGTTGCCCCAGAAGTTTTGCTGAACTGTTACAT 

CAGTGTTGGGAAGCTGATGCCAAGAAACGGCCATCATTCAAGCAAATCATTTCAATCCTG 

PAGTCCATGTCAAATGACACGAGCCTTCCTGACAAGTGTAACTCATTCCTACACAACAAG 

GCGGAGTGGAGGTGCGAAATTGAGGCAACTCTTGAGAGGCTAAAGAAACTAGAGCGTGAT 

CTCAGCTTTAAGGAGCAGGAGCTTAAAGAACGAGAAAGACGTTTAAAGATGTGGGAGCAA 

AAGCTGACAGAGCAGTCCAACACCCCGCTGCTGCCTTCCTTTGAGATTGGTGCATGGACG 

CAAGACGATGTGTATTGGTGGGTTCAGCAGCTCGTCAGAAAAGGTGACTCTTCAGCAGAG 

ATGAGTGTATATGCAAGCTTGTTTAAAGAAAACAACATTACAGGGAAGCGGCTGCTGCTG 

CTGGAGGAAGAAGACCTGAAAGACATGGGCATTGTCTCCAAGGGGCATATCATTCACTTC 

AAGTCAGCCATTGAGAAATTAACCCATGATTACATAAATTTGTTTCACTTCCCACCACTA 

ATTAAGGACTCAGGAGGTGAACCTGAAGAAAATGAGGAAAAAATAGTGAACCTGGAACTG 

GTTTTTGGTTTTCACTTGAAACCAGGAACTGGCCCACAGGATTGTAAGTGGAAAATGTAT 

ATGGAGATGGATGGGGATGAAATTGCAATAACCTACATAAAAGATGTGACATTCAACACT 

aIcctacctgatgcggagattttaaagatgacaaagccaccatttgtaatggagaagtgg 

ATTGTAGGAATAGCAAAAAGTCAGACTGTGGAGTGCACTGTCACATATGAGAGTGATGTT 
AGAACTCCAAAAAGCACTAAACATGTCCATTTGATTCAGTGGAGTAGAACAAAACCTCAG 
GATGAAGTGAAAGCAGTCCAACTTGCCATTCAGACATTATTCACCAATTCAGATGGCAAC 
CCTGGAAGCAGGTCCGACTCAAGTGCTGATTGCCAGTGGTTAGATACTCTGAGGATGCGG 
CAGA?TGCATCCAACACTTCTTTACAGCGTTCCCAGAGCAATCCTATTCTGGGGTCACCG 
TTCTTCTCACACTTTGATGGCCAGGATTCCTACGCTGCTGCTGTGAGACGGCCCCAGGTG 
CCCATTAAGTATCAACAGATTACACCTGTGAACCAGTCCAGAAGCTCGTCTCCTACTCAG 

Stggactgaccaaaaacttctcttccttacatctcaactctagggacagtggcttttcc 

AGTGGCAATACTGACACCTCTTCAGAGAGGGGTCGATACTCAGACAGAAGCAGGAACAAA 
TATGGACGTGGTAGTATATCACTCAATTCTTCTCCTAGAGGAAGATACAGTGGAAAGAGT 
CAGCATTCCACTCCATCAAGAGGAAGATACCCTGGAAAGTTCTACAGGGTTTCTCAGTCA 
GCACTCAATCCTCACCAGTCGCCTGACTTCAAGAGAAGCCCCAGGGACCTCCACCAACCC 
AACACCATACCAGGGATGCCTTTGCACCCTGAGACTGACTCAAGAGCCAGTGAAGAGGAC 
AGCAAAGTCAGCGAAGGGGGCTGGACAAAAGTGGAATACCGGAAAAAGCCCCACAGGCCA 
TCTCCCGCCAAAACCAATAAAGAGAGAGCCAGAGGGGACCACCGTGGATGGAGAAACTTT 

TGA 

qEO ID NO: 83 AI375137 H 

ATGGGAAATTATAAATCTAGACCAACCCAAACTTGTACTGATGAATGGAAGAAAAAAGTC 

AGTGAATCATATGTTATCACAATAGAAAGATTAGAAGATGACCTGCAGATCAAGGAAAAA 

G^TCACAGAACTAAGGAATATATTTGGCTCTGATGAAGCCTTCAGTAAAGTCAATT^ 

AATTACCGCACTGAAAATGGGCTGTCTCTACTTCATTTATGTTGCATTTGTGGAGGCAAG 

AAATCACATATTCGAACTCTTATGTTGAAAGGGCTCCGCCCATCTCGACTGACAAGAAAT 

GGATTTACAGCCTTGCATTTAGCAGTTTACAAGGATAATGCAGAATTGATCACTTCTCTG 

otSSgtggagctgatatacagcaggttggatacggtggcctcactgccctccatatt 

gSacaatagctggccacctagaggctgctgatgtgctgttgcaacatggagctaatc 

aata™aagatgc^gtttttttcactccattgcatattgcagcgtactatg(^catgaa 

caggtaactcgccttcttttgaaatttggtgctgatgtaaatgtaagtggtgaagttgga 

ga?a™ccc?ccacctagcatctgcaaaaggattcttgaatattgcaa^ 

ga^gaaggcLcaaagcagatgtgaatgctcaagataatgaagaccatgtcc^^^ 

?^?gt?ctcgatttggacaccatgatatagttaagtatctgctgcaaagtgatttggaa 

g^caIcctcatgttgttaatatctatggagatacc 
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FIGURE 2000 

GGCAAATTTGAAGTTGCCAAGGAAATCATCCAAATATCAGGAACAGAAAGTCTGACTAAG 

GAAAACATCTTCAGTGAAACAGCTTTTCATAGTGCTTGTACCTATGGCAAGAGCATTGAC 

CTAGTCAAATTTCTTCTTGATCAGAATGTCATAAACATCAACCACCAAGGAAGGGATGGG 

CACACTGGATTACACTCTGCTTGCTACCACGGTCACATTCGCCTGGTTCAGTTCTTACTG 

GATAATGGAGCTGATATGAATCTAGTGGCTTGTGATCCCAGCAGGTCTAGTGGTGAAAAA 

GATGAGCAGACATGTTTGATGTGGGCTTATGAAAAAGGGCATGATGCCATTGTCACACTC 

CTGAAGCATTATAAGAGACCACAAGATGAATTGCCCTGTAATGAATATTCTCAGCCTGGA 

GGAGATGGCTCCTATGTGTCTGTTCCATCACCCTTGGGGAAGATTAAAAGCATGACAAAA 

GAGAAGGCAGATATTCTCCTCCTAAGAGCTGGATTGCCTTCACATTTCCATCTTCAGCTC 

TCAGAAATTGAGTTCCATGAGATTATTGGCTCAGGTTCTTTTGGGAAAGTATATAAAGGA 

CGATGCAGAAATAAAATAGTGGCTATAAAACGTTATCGAGCCAATACCTACTGCTCCAAG 

TCAGATGTGGATATGTTTTGCCGAGAGGTGTCCATTCTCTGCCAGCTCAATCATCCCTGC 

GTAATTCAGTTTGTGGGTGCTTGCTTGAATGATCCCAGCCAGTTTGCCATTGTCACTCAA 

TACATATCAGGGGGTTCTCTGTTCTCCCTCCTTCATGAGCAGAAGAGGATTCTTGATTTG 

CAGTCTAAATTAATTATTGCAGTAGATGTTGCCAAAGGCATGGAGTACCTTCACAACCTG 

ACACAGCCAATTATACATCGTGACTTGAACAGTCACAATATTCTTCTCTATGAGGATGGG 

CATGCTGTGGTGGCAGATTTTGGAGAATCAAGATTTCTACAGTCTCTGGATGAAGACAAC 

ATGACAAAACAACCTGGGAACCTCCGTTGGATGGCTCCTGAGGTGTTCACGCAGTGCACT 

CGGTACACCATCAAAGCAGATGTCTTCAGCTATGCTCTGTGTCTGTGGGAAATTCTCACT 

GGCGAAATTCCATTCGCTCATCTCAAGCCAGCGGCTGCGGCAGCAGACATGGCTTACCAC 

CACATCAGACCTCCCATTGGCTATTCCATTCCCAAGCCCATATCATCTCTGCTGATACGA 

GGGTGGAACGCATGTCCTGAAGGAAGACCCGAATTTTCTGAAGTTGTCATGAAGTTAGAA 

GAGTGTCTCTGCAACATTGAGCTGATGTCTCCTGCATCAAGTAACAGCAGTGGGTCTCTC 

TCACCTTCTTCTTCTTCTGATTGCCTGGTGAACCGGGGAGGACCTGGCCGGAGTCATGTG 

GCAGCATTAAGAAGTCGTTTCGAATTGGAATATGCTCTAAATGCAAGGTCCTATGCTGCT 

TTGTCCCAAAGTGCTGGACAATATTCCTCTCAAGGTCTGTCTTTGGAGGAGATGAAAAGA 

AGTCTTCAATACACACCCATTGACAAATATGGCTATGTATCCGATCCCATGAGCTCAATG 

CATTTTCATTCTTGCCGAAATAGTAGCAGCTTTGAGGACAGCAGCTGA 

SEQ ID NO: 84_H97685_H 

ATGATTTCTTGCCTGTNATAACCTATGCACTCACAAAGATGAACTCTCTGAGAGGGATGA 
GCAAGAGCTTCAGGAAATCCGAAAGTATTTCTCCTTTCCTGTATTCTTTTTCAAAGTGCC 
GAAACTGGGCTCGGAGATAATAGACTCCTCAACCAGGAGAATGGAGAGCGAAAGATCACC 
GCTTTATCGCCAGCTAATTGACCTGGGGTATCTGAGCAGCAGTCACTGGAACTGTGGGGC 
TCCTGGCCAGGATACTAAAGCTCAGAGCATGTTGGTGGAACAGAGTGAAAAGCTGAGACA 
CTTGAGCACATTTTCTCACCAGGTGTTACAGACTCGCCTGGTGGATGCAGCCAAGGCCCT 
GAACCTGGTGCACTGCCACTGCCTTGACATCTTTATTAACCAGGCATTTGACATGCAGCG 
GGACCTGCAGATCACTCCCAAACGTCTGGAATATACTCGAAAAAAGGAGAATGAGTTGTA 
TGAATCATTGATGAATATTGCCAACCGAAAGCAGGAGGAAATGAAGGATATGATTGTTGA 
GACACTTAATACCATGAAGGAGGAACTTCTGGATGATGCTACTAACATGGAGTTTAAAGA 
CGTCATTGTCCCTGAGAATGGAGAACCAGTAGGCACCAGAGAGATCAAATGCTGCATCCG 
ACAGATCCAGGAACTCATCATCTCCCGACTTAATCAGGCAGTGGCTAATAAGCTGATCAG 
CTCAGTGGATTACCTGAGGGAAAGCTTCGTCGGAACCCTGGAACGATGTCTGCAGAGCCT 
GGAGAAGTCTCAGGATGTCTCAGTTCACATCACCAGTAATTATCTCAAACAGATCTTAAA 
TGCTGCCTATCATGTTGAAGTCACGTTTCACTCAGGGTCGTCAGTTACAAGGATGCTATG 
GGAGCAAATCAAACAGATCATCCAGCGCATCACATGGGTGAGCCCACCTGCCATCACTCT 
GGAATGGAAGAGGAAGGTGGCCCAGGAAGCCATTGAGAGCCTCAGCGCCTCCAAATTGGC 
TAAGAGCATTTGCAGCCAATTCCGGACTCGGCTCAATAGTTCCCACGAGGCTTTTGCAGC 
CTCCTTGCGGCAGCTGGAAGCTGGCCACTCAGGCCGGTTAGAGAAAACGGAAGATCTATG 
GCTGAGGGTTCGGAAAGATCATGCTCCCCGCCTGGCCCGCCTTTCTCTGGAAAGCCGTTC 
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FIGURE 2PPP 

TTTACAGGATGTCTTGCTTCATCGTAAACCTAAACTGGGACAGGAACTGGGCCGGGGCCA 
GTATGGTGTGGTATACCTGTGTGACAACTGGGGAGGACACTTCCCTTGTGCCCTCAAATC 
AGTTGTCCCTCCAGATGAGAAGCACTGGAATGATCTGGCTTTGGAATTTCACTATATGAG 
GTCTCTGCCGAAGCATGAGCGATTGGTGGATCTCCATGGTTCAGTCATTGACTACAACTA 
TGGTGGTGGCTCCAGCATTGCTGTGCTCCTCATTATGGAGCGGCTACACCGGGATCTCTA 
CACAGGGCTGAAGGCTGGGCTGACCCTGGAGACACGTTTGCAGATAGCACTAGATGTGGT 
GGAGGGAATCCGCTTCCTGCACAGCCAGGGACTTGTCCATCGTGATATCAAACTGAAAAA 
TGTGCTGCTGGATAAGCAGAACCGTGCCAAGATCACTGACTTAGGATTCTGCAAGCCAGA 
GGCCATGATGTCAGGCAGCATTGTGGGGACACCAATCCATATGGCCCCTGAACTTTTCAC 
AGGGAAGTACGATAATTCCGTGGATGTCTACGCTTTTGGAATTCTTTTCTGGTATATCTG 
CTCAGGCTCTGTCAAGCTCCCTGAGGCATTTGAGAGGTGTGCTAGCAAAGACCATCTCTG 
GAACAATGTGCGGAGGGGGGCTCGCCCAGAACGTCTTCCTGTGTTTGATGAGGAGTGCTG 
GCAGTTGATGGAAGCCTGTTGGGATGGCGACCCCTTGAAGAGGCCTCTCTTGGGCATTGT 
CCAGCCCATGCTCCAGGGCATCATGAATCGGCTCTGCAAGTCCAATTCTGAGCAGCCAAA 
CAGAGGACTAGATGATTCTACTTGAAAGCAAAGACCTTTCTCTTTCACTCTCTAGTTATT 
TCCTTCCCCCTCACCATTTGGCCATGGGGAGAATTTGACATTTATTCACTATAGGACACA 
CTCCCAAGGGAACTGGTGCTTGCTGGGAAACTTGGAACCTTCCCAGGCAGGGATGACTCC 
TGGACAGTGAAGAGTTGAATGACTGAGCATATTCAGCAGCTCACTGAAGCGCCAAGCTAT 
CCCTTTAGCAAAAAAGTGTCTCAGATGTGTAAAAGCTGAGGAATGTGGTGTTCTGGCTTC 
ACAAATGAAAAGGAGGCAGATGTT 

SEQ ID NO: 85_W2 0810_M 

TTGATGTCAACCTGAAGGCTTCTAAAGCGAGTGATGTCTACAGCTTTGGGATCCTCGTGT 
GGGCAGTGCTGGCTGGCAGAGAAGCTGAGTTGGTAGACAAGACTTCACTAATCCGGGAAA 
CAGTGTGTGACAGGCAGAGTCGTCCTCCACTGACAGAGCTGCCTCCAGGTAGCCCTGAGA 
CTCCCGGCTTGGAAAAACTGAAGGAGTTAATGATTCATTGCTGGGGTTCCCAGTCCGAAA 
ACAGGCCATCCTTCCAGGACTGCGAACCAAAAACCAATGAAGTTTACAATCTGGTAAAGG 
ACAAGGTAGATGCTGCTGTCTCCGAGGTAAAGCATTATCTGTCTCAGCACAGAAGCAGCG 
GCAGAAACTTGTCTGCCAGAGAGCCAAGCCAAAGAGGCACAGAAATGGATTGCCCGAGGG 
AAACCATGGTTTCTAAAATGCTGGACCGCCTGCATTTGGAGGAACCCTCCGGACCAGTTC 
CTGGAAAATGTCCTGAGAGGCAAGCACAGGACACATCAGTTGGGCCTGCCACACCAGCAA 
GGACATCTTCTGACCCCGTGGCTGGCACTCCTCAGATTCCACATACTTTACCCTTCAGAG 
GCACAACACCTGGGCCAGTCTTTACTGAGACTCCCGGTCCTCACCCCCAAAGGAATCAGG 
GAGATGGAAGACACGGCACTCCTTGGTATCCCTGGACCCCACCGAATCCAATGACAGGGC 
CACCGGCTCTCGTCTTCAACAACTGTTCTGAAGTGCAGATTGGGAACTACAACTCCTTGG 
TAGCACCACCAAGAACTACTGCCTCAAGTTCGGCCAAGTATGACCAAGCACAGTTCGGCA 
GGGGTAGGGGCTGGCAGCCCTTCCACAAGTAGACTTCAGAGAATCACTGCAAGAGCCTGA 
AGTGTGCCATTCAGCGTGGCAATAAAAAGCACGTTTTAAGCAACCTGGACTGGCTAAGAC 
AGTCCTTGCCACTTCCTGAAGCTCACAACATTCTGTGAGGACAGTTGGACCTACACCCAA 
ACTGACTCTTGACCCATCTCCTTAAAGTCAATAAACATAGCATGTTAAGTGTG 

SEQ ID NO: 86_AA744236_H 

ATGGGATCAGAGAACAGTGCTTTAAAGAGCTATACACTGAGAGAACCACCATTTACCTTA 
CCCTCTGGACTTGCTGTTTATCCCGCTGTACTGCAAGATGGCAAATTTGCTTCAGTTTTT 
GTGTATAAGAGAGAAAATGAAGACAAGGTTAATAAAGCTGCCAAGCATTTGAAGACACTT 
CGTCACCCTTGCTTGCTAAGATTTTTATCTTGTACTGTGGAAGCGGATGGCATTCATCTT 
GTCACTGAGCGAGTACAGCCCCTGGAAGTGGCTTTGGAAACATTGTCTTCTGCAGAGGTC 
TGTGCTGGGATCTATGACATATTGCTGGCTCTTATCTTCCTTCATGACAGAGGACACCTA 
ACACACAATAATGTCTGTTTATCATCTGTGTTTGTGAGTGAAGATGGACACTGGAAGCTA 
GGAGGAATGGAAACTGTTTGTAAAGTTTCTCAGGCCACACCAGAGTTTCTGAGGAGTATT 



PCTAJS00/14842 

WO 00/73469 

FIGURE 2QQQ 
CAGTCAATAAGAGACCCAGCATCTAT^ 

CTCCCAGAGTGTCATGGACATGCCCGGGATGCCT11 T TCAACAGACC 
TTGCTCACAATCTTA^TGA^^ 

TTGCACTCAACTTTGCTGAATCCCATTCCAAAAi^i OT5AAAAGTTTAACA 
TCTCATGACTTCTTCAGAA^ 

TTGAAGAGTGAAGAGGAGAAAACGGAATTCTTTAAAlll AGTTGGTGTTT 
TTGTCAGAGGAATTGATAGCTTCA^GGTTGGTGCCT 

GCAGAGCCAGTGGCTGTTAAGAGTTTTCTTCCTTA1C TCA CGGGTGATCCCC 

GCGCAGGGAGAAACTCCTTGCTTGCTCTCACCAGCCCTOT^ 

GTGCTTCTCCAGTTGTTTGAAGTTCATGAAGAGCATGTGC^ 

ATCGAGGCCTACGTGGAGCACTTCACTCAGGAGCAGCT^^ 

GTTTTGCTGGGCCTGCGTGATACTAGCGATTCCATTGTGGC 

GCAGTGCTGGTCTCTCTGCTTG^ 
TTCAAACGCACTGCCCCAAGTTTTACTA^ 

TTTTCTCAGCCTATTAA^TTTCCCATA^ATGG GAGTGGCCT GACTGGAGTGAA 
GACAGTGAAAACTTCCCATCAAGTTCTAAAAAGIL TAGAGA ACCTTGTGAT 
CCTGAGGAGCCTGAAAATCAAACTGTCA^CATACAGATT^ 

GATGTCAAGTCCCAGTGCACTACCTTG^ 
CCCAGCAGCTTAGATACTAAAGTAAACCCAGGA^ 
ACCTCAGGGGAGCAGAAGCCTATTCCTGCTTTGCTTT 
TGGAAATCAAGCTTACCCCAAAAGATTAGCCTTGTA^ 

ATCGAGCCGCCAAAAGTGTCATCACAAGAAA^^ 

TTAGGAGAGGAATTCACCATTCAAGTAAAAAAGAAGCCAGT^ 

TGGTTTGCTGATATGATCCCAGAAATTA^GCC^TTCTGCTC 

CTGAACTGGGAAGATAATAACTGGTGA 
CGATGCTTAATAAATTGAAGAGTACTGT^ 

CTTCMTGTGTCTTCCAAATCCTGAATATTTCGCTCCJGAfl TTAAT AAAG 

g<^cctatatttg^gt™gc^gma™c^gA ot ^ g ^ gttcotaac 



PCT/USO0/14842 
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FIGURE 2RRR 

cpo ID NO: 88 AA278842_H ^ rr __ rGCTCAGCC CCGGCTCGGGCGGCCGGAG 

^gtgScgW^ 

mmm 

-gSagc?^ 

^S^CCCAACCACTGCCCCAACAGAAACCA^^ 

gs=s=s====-— 

52 1 II) 
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figure 2sss 
acag T cgacagg T caagg = 

ATCAGAGCCACAATAAATTCTATTTCAC 

SE Q ID NO: B9_M599286 H GGTGCTGCTG GACGACACGGTGCCGCTG 
ATGGCCTTCATGGAGAAGCCGCCAGCCGGC^ 

ACAGCAGCCATCGAGGCGAGCCAGAGCC^ 

CAAGGAGGAATTTCTGTGGAAAA^^ 

TTGCTTAACAACAGCTTACAGATTGCAG^ 

ATTGGTAACATGGATCGTGAATTCATAGC^ 

AACGTGATCAC^CAAATCATATCTTGTCTAATTG^ TTT CCATGTTCTTC 

GCAAAACCAAAAGACCCATTTCTAAAGAAGT^ 
GAACTCCAGCAAATAAAAACATATGG^ACGGCAAATA 

GACAAGGGATTCCCTTATGGGCATCTTC^^^ 

TGCCGGCTGCTGGACCTTGAGAMTC^ 

TTTTCACAATTCAGGAAAATCAAT^ 
TTACTGTATGAAATGACTTATGGACGACCGCCAGALl^ 

CCTGCCCCGTCCATGGCTGTGCTGGCCGTGTTGGAGTCT 

AAAAATGGCATGCCTACCATCTCCCGGCTCTTAC^ 

CTAACCACTTCTGAAAAACCACAGT^ 

ATTGCCAAAGAATGTATAGAGAAGAGACTAATl^A aaaGAAAAAAAAGAAAG 
CGAAGACTGACAAGAGCTCAGTC^ 
ATTTTAGCTCGAAAGAAGTCAAAACGATCTGCT^^^ 
AAGTACAGCAACTCCAATAATTCAGCAGGATCTGGG^^ 

tcatcgccaactccaccctctacatcagggatatct^ 

CCACCACCACCAGCAGCTCCCTTGCCTCCT^ 

TCTCAGGCTGTGAATGGCATGAGCC^ 
AAAGGAACTTTGAGGAAAGCCAAACCTGTGATCA 

TCCTGTTTACACTTGGAGGGAAAAGTTCTTT^ 

TgS™^ 

TGGCATGCAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 90_AA425725 H GCGGCAGCGGCG GCAGTAGCAGCAGC 

ATGAGCGCCAGCACGGGCGGTGGTGGGGA^ 

TCACAGGCCTCCTGCGGGCCCGAGTCCTCGGGCTC^ 

CCTCAGATGCTGCAGGGCCTTCTGGGCTCCG^AC^ 

TACTGCAAGGGCGGCTACCACCCTGTGAAGAT^ 

GTGGTGCGCAAACTGGGCTGGGGCCACTT^^ 

CGCAAGCGCTTTGTGGCCCTCAAAGTGGTGA^GAGT^ 
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FIGURE 2TTT 



TGCATGGTGCTGGAGGTGCTGGGCCACCAGCTCCTCAAATGGATCATCAAGTCCAACTAC 
CAGGGCCTGCCCGTGCCCTGCGTGAAGAGCATCGTGAGGCAGGTGCTGCACGGCCTGGAC 
TACCTCCACACCAAGTGCAAGATCATCCACACGGACATCAAGCCCGAGAACATCTTGCTG 
TGTGTGGGGGACGCTTACATCAGGCGCCTGGCTGCCGAGGCCACGGAGTGGCAACAGGCA 
GGGGCGCCGCCCCCCTCCCGCTCCATAGTCAGCACTGCCCCCCAGGAGGTCTTGACCGGT 
AAGCTGTCCAAAAACAAGAGGAAGAAGATGAGGCGCAAACGGAAACAGCAGAAGCGGCTG 
CTGGAGGAGCGGCTGCGGGACCTGCAGAGGCTGGAGGCCATGGAGGCTGCCACCCAGGCT 
GAGGACTCTGGCTTGAGACTAGACGGGGGCAGCGGCTCCACATCCTCTTCAGGCTTCTCC 
GGCTCCCTCTTCTCTCCTGCCTCCTGCTCCATCCTCTCCGGCTCGTCC7VATCAGCGAGAG 
ACCGGGGGCCTCCTGTCGCCTAGCACACCATTCGGTGCCTCGAACCTCCTGGTGAACCCC 
CTGGAGCCCCAAAATGCAGATAAGATCAAGATCAAGATCGCAGACCTGGGCAACGCCTGC 
TGGGTGCACAAGCACTTCACGGAAGACATCCAGACTCGGCAGTACCGGGCCGTCGAGGTG 
CTGATCGGCGCCGAATACGGCCCCCCGGCAGACATCTGGAGCACAGCCTGCATGGCCTTC 
GAGCTGGCCACTGGTGACTACCTGTTCGAGCCGCATTCTGGAGAAGACTACAGTCGTGAT 
GAGGACCACATCGCTCACATAGTGGAGCTTCTGGGGGACATCCCCCCAGCCTTCGCCCTC 
TCAGGCCGCTATTCCCGGGAGTTCTTCAACCGGAGAGGAGAGCTGCGGCACATCCACAAT 
CTCAAGCACTGGGGCCTGTACGAGGTACTCATGGAAAAGTACGAGTGGCCCCTAGAGCAG 
GCCACACAGTTCAGCGCCTTTCTGCTGCCCATGATGGAGTACATCCCCGAAAAGCGGGCC 
AGTGCCGCTGACTGCCTCCAGCACCCCTGGCTCAACCCCTAG 

SEQ ID NO: 91_SGK022_H 

TCTGGCCCTGTCCCTCCCCACCACCCGCCGCTGTGTCCAGACAGAGAATGTTCTAACGCT 
GGGGGCGGCTGCGGATGAAGTCCTTGGGGAGAAAAGGAGCAGGCCAAGGGCGATGGTGGA 
GTAGAGCTGCCTCTCAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGAC 
AGCATGGAGGACTTTCTGCTCTCCAATGGGTACCAGCTGGGCAAGACCATTGGGGAAGGG 
ACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACACCAAAGAAAAGTGGCAATTAAA 
GTTATAGACAAGATGGGAGGGCCATCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTC 
CAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCT 
GCCGACGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGC 
GTGCTGAATGGGGGGCCACTGCCTGAAAGCCGGGCC7VAGGCCCTCTTCCGTCAGATGGTT 
GAGGCCATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAAC 
GCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCC 
AAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAG 
GTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTC 
CTGTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTG 
TGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATCTGAGCATCTCGGCCGATTGCCAG 
GACCTGCTCAAGAGGCTCCTGGAACCCGATATGATCCTCCGGCCTTCAATTGAAGAAGTT 
AGTTGGCATCCATGGCTAGCAAGCACTTGATAAAAGCAATGGCAAGTGCTCTCCAATAAA 
GTAGGGGGAGAAAGCAAA 

SEQ ID NO: 92_AA0 60026_M SGK022_M 

CAGACGGAGAATGTTCTAGCCCTGGAGGCAGCTGTGAATGAAGTCCTTGGGGGGAAAAGA 
AGCAGGCCGAGGGCGATGGTGGAGTAGAGCTGCCTCGCAGAGGCAGCATGAGCTGAGAGG 
GTGACAAGAAGGAGGCGCTACACAGCATGGAGGACTTTCTACTCTCCAATGGGTATCAGC 
TGGGCAAGACCATTGGGGAAGGGACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAAC 
ATCAAAGAAAAGTGGCAATTAAAATTATAGACAAGATGGGAGGGCCAGAAGAGTTTATCC 
AGAGATTCCTGCCTCGTGAGCTCCAGATTGTCCGTACCCTGGACCACAAAAACATCATCC 
AGGTGTATGAGATGCTGGAGTCAGCAGATGGT^AAAATCTACCTGGTGATGGAACTGGCTG 
AGGGAGGGGATGTCTTTGACTGTGTGCTGAACGGAGGGCCACTTCCCGAGAGCCGGGCCA 
AGGCCCTCTTCCGCCAGATGGTTGAGGCTATTCGCTATTGCCATGGCTGTGGCGTGGCCC 




WO 00/73469 



PCTAJS0O/14842 



FIGURE 2UUU 

ACCGGGACCTTAAGTGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACCGACT 
TTGGCTTTGCCAAGGTGCTACCCAAGTCACGCAGGGAGCTGAGCCAGACCTTCTGTGGCA 
GCACAGCCTATGCCGCCCCTGAGGTGCTACAGGGCATACCCCATGATAGCAAGAAAGGTG 
ATGTCTGGAGCATGGGTGTGGTCCTGTATGTAATGCTCTGTGCAAGTCTACCTTTTGATG 
ACACAGATATCCCCAAGATGCTGTGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATT 
TGGGCATCTCAACCGAATGCCAGGACCTGCTCAAGCGGCTCCTGGAACCAGACATGATAC 
TCCGGCCTTCAATCGAAGAAGTTAGTTGGCACCCATGGCTAGCAAGCACTTGATAAAAGC 

AATGGCAAGTCCTCCCCAATAAAGTAGGGGGAGAAAGCAAACTG 



3GGATTACAGGCGTGAGCCACCGCGCCCGGCCGCACTTCATTCTCAA 



SEQ ID NO: 93_AA3 99669_H 
CTCCCAAAGTGCTGGGATTACAGGC 

GTTTTGTGGCCAACGATGGATAGGAGGTGGATTGTGATGTATTCGGAACATGGGACCTTG 
AGGAGTTCCGTAACCAAAAGGAGAAAGTAACAACAGCCAGTGGAGACAAAAAGAACTGCT 
TCTCTTTCTTTCCCCCTCCAAGTTCCTAGTGGAGGGCTGAGTCCAGCATCCCAGACTCGT 
GTGACTATATAGGCAAGCATTTGGGGACCTACTTCACTTTGATACCCTAGCCTTCAGCAG 
CTCAAGGTGTTGGCCTTTGGATAGGAGGCTTCCAAGTAGTAAAGCTCCCTGCTCTCAGCA 
AGCCCAACACCATGGGGAAGGGAGATGTCTTAGAGGCAGCACCAACCACCACAGCCTACC 
ATTCCCTCATGGATGAATATGGTTATGAGGTGGGCAAGGCCATTGGCCATGGCTCCTATG 
GGTCGGTATATGAGGCTTTCTACACAAAGCAGAAGGTTATGGTGGCAGTCAAGATCATCT 
CAAAGAAGAAGGCCTCTGATGACTATCTTAACAAGTTCCTGCCCCGTGAAATACAGGTAA 
TGAAAGTCTTGCGGCACAAGTACCTCATCAACTTCTATCGGGCCATTGAGAGCACATCTC 
GAGTATACATCATTCTGGAACTGGCTCAGGGTGGTGATGTCCTTGAATGGATCCAGCGCT 
ACGGGGCCTGCTCTGAGCCCCTTGCTGGCAAGTGGTTCTCCCAGCTGACCCTGGGCATTG 
rCTACCTGCACAGCAAGAGCATCGTGCACCGGGACTTAAAGTTGGAGAACCTGTTGCTGG 
ACAAGTGGGAGAATGTGAAGATATCAGACTTTGGCTTTGCCAAGATGGTGCCTTCTAACC 
AGCCTGTGGGTTGTAGCCCTKCTTACCGCCAAGTGAACTGCTTTTCCCACCTCAGCCAGA 
CTTACTGTGGCAGCTTTGCTTACGCTTGCCCAGAGATCTTACGAGGCTTGCCCTACAACC 
CTTTCCTGTCTGACACCTGGAGCATGGGCGTCATCCTTTACACTCTAGTGGTCGCCCATC 
TGCCCTTTGATGACACCAATCTCAAAAAGCTGCTAAGAGAGACTCAGAAGGAGGTCACTT 
TCCCAGCTAACCATACCATCTCCCAGGAGTGCAAGGTCCAACTGCTCATTGCCTGTGTGG 
CACAATGGAGAAAAACTCAGGCAAGACCTCTCTCTCCCCTGCTCTAGAACCTGATCCTCC 
AGATGCTACGCCAAGCCACTAAGCGTGCCACCATTCTGGACATCATCAAGGATTCCTGGG 
TGCTCAAGTTCCAGCCTGAGCAACCCACCCATGAGATCAGGCTGCTTGAGGCCATGTGCC 
AGCTCCACAACACCACTAAACAGCACCAATCCTTGCAAATTACGACCTGAAAATGGCTGA 
GGGAGGGGGCTAAGAGAGGAGCAAAGCAGGAGGTCTTGGGCTAAAAATCTTTTTTACCAA 

AAATAAATCTAAGTCTGATTTAGTTTCATCAAAAAAA 
SEQ ID NO: 94 AA75853 9_H 

rArCATTCAGACGCCTCCGGTAGTGTAAATGAGGACAATGCCTGCTGGCCCACATGACGG 
GGGGATGTAGACGGCAGCGGCGCCAGTCGCTCCTGGCACCATGGACGATGCCACAGTCCT 
AAGGAAGAAGGGTTACATCGTAGGCATCAATCTTGGCAAGGGTTCCTACGCAAAAGTCAA 
ATCTGCCTACTCTGAGCGCCTCAAGTTCAATGTGGCTGTCAAGATCATCGACCGCAGGAA 
AACACCTACTGACTTTGTGGAGAGATTCCTTCCTCGGGAGATGGACATCCTGGCAACTGT 
CAACCACGGCTCCATCATCAAGACTTACGAGATCTTTGAGACCTCTGACGGACGGATCTA 
CATCATCATGGAGCTTGGCGTCCAGGGCGACCTCCTCGAGTTCATCAAGTGCCAGGGAGC 
CCTGCATGAGGACGTGGCACGCAAGATGTTCCGACAGCTCTCCTCCGCCGTCAAGTACTG 
CCACGACCTGGACATCGTCCACCGGGACCTCAAGTGCGAGAACCTTCTCCTCGACAAGGA 
CTTCAACATCAAGCTGTCTGACTTTGGCTTCTCCAAGCGCTGCCTGCGGGACAGCAATGG 
GCGCATCATCCTCAGCAAGACCTTCTGCGGGTCGGCAGCATATGCAGCCCCCGAGGTGCT 
GCAGAGCATCCCCTACCAGCCCAAGGTGTATGACATCTGGAGCCTGGGCGTGATCCTGTA 
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FIGURE 2VVV 

CATCATGGTCTGCGGCTCCATGCCCTATGACGACTCCGACATCAGGAAGATGCTGCGTAT 

cSgaIggIgcaccgtgtggacttcccgcgctccaagaacctgacctgcgact 

CC^CATCTACCGCATGCTGCAGCCCGACGTCAGCCAGCGGCTCCACATCGATGAGATCCT 

cagcSctcgtggctgcagccccccaagcccaaagccacgtcttctgcctccttcaaga^ 
^agSSagggcaagtaccgcgctgagtgcaaactggacaccaagacaggct 

rr agaacgag^caggatggaggacaggctggccgagacctccagggccaaagaccatca 
SggScg^^ 

ggtaggatctgaagaaggcacaggtgcaagtaaaattcgtcaattaaaccactattttga 

TT 

RTGTCGGGAGACAAACTTCTGAGCGAACT 

SgSg??ac^ca^gtgaaggtggc 

aarrTGGTGGACCGGCGGCGAGCGCCCCCGGACTTCGTCAACAAGTTCCTGCCGCGAGAG 
C^G?C?S22?GCGGGGCGTGCG 

SgSaIcgggaaactgtacatcgtgatggaagcggccgccaccgacctgc 
g^Sg^Sa?^^ 

gg?gc2g?gcgctacctgcacgatcatcacctggtgcaccgcgacctcaagtgcg^ 

™c?gSgagcccggacgagcgccgc 

?c?S?ggSacccLacctgagcaccacct^ 

gaggtcScSSggcatcccctacgac 

g?gS?Scgtcatggtcaccgggtgcatgcccttcgacgactcggacat 
cc?cggcgcSgaLcgcggcgtgctctatcccgaaggcc 

SgSc?gScg^cgagctgctgcagttcagcccgtccgccaggccctccgcgggccag 
gtagcgcgcaactgctggctgcgcgccggggactccggctag 

===== 

S?gS?gSSactaacgacaaaaata^ 

^arrTaPArACACTTAAGACCATTAAGAAAGCCAAGAAATAAGACCCAGACAAGGTGGGC 

™cS?2otac?tcctcagagttta^ 

rrTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGA 

cgggSIa^SgcSggtgatggagctcgctgagggaggg 

PAATGG^GGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAG^ 

cScScSSScI?ggctgtggtgtggccc^ 

p^PP^qqqq^q^CCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCCAAGTC 

aScSggagS^ 



gcagggcattcccnncaagatgctgtggca 



GCAGCAGAAGGGGGTGTCCTTCCCCACTCA 



CAAGAGGCTCCTGGAACCCGATATGAT 



GCAATGGCAAGTGCTCTCCAATAAAGTAGGGGGAGAAAGCAAACCC 
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FIGURE 2 WWW 

CCTGGGTTATGCTGAGGAGCCCTGCTATCTCTGGTTTGTCAl^ GTTT CAT 

agIcctgaatcagtatgtcctgtcccggaggccaga^ 

AGAAAACCGAAAGATGGAGTTGCAGATC^^ 
TTTGGGTGATTTTAAACTAGGTCGA^CCTCGGGACCCACA 

ScCTTTTCTTTTCTTTTTTTOTTT^ 

TGTGGGCAAGGCATATGTGTA^TTTCACTTTTACTTTTTA ^ 
TTTTGGTTTTGTCCTTCACTTTCCCTCTGTCTTCCTTC1 TCTGGGGGT ANCT 
rrATGACACCTCACTTCCCTAGAGAAGGCCTGCCTCCC^^ 

gaggtgaagaaccgcc 

ssssssssss 

cgcagtggggccagggtggcagtcaagaagatccg^ 

ttggcactagcagaattctgggccctcaccagtctcaag^ 

cagtttgaggagtgcgtcctacagcgcaacgggttagcccagc^^ 

^gaactcacagctttacctgcgcctggtggagacct^ 

ggctatgctgaggagccctgctatctctggtttg^ 

ctcaatcagtatgtcctgtcccgga^ 

cagcttacaagcgccattgccttcc^ 

ccagacaacatcctgatcacagagcgg^^ 

ggactgagcaaggtctgtgcagggctggcacccc^ 

aaaaatgtgaatgtgaataa^^ 
gctcccgaagtctgggagggacactatacagcc^^ 

atcatctgggcaatgatagaaagaattacct™^ 
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FIGURE 2XXX 

AAGCAGCTCTTGAAAGACATGTTAGCTGCTAACCCACAGGACCGACCTGATGCTTTTGAA 
CTTGAAACCCGAATGGACCAGGTCACATGTGCTGCTTAAACTCCAGGGCTGAACGTCTTG 
GGTGTTTTTAAACTAGGTCGATCCTTCGGGACCCACAGTCTCATCGTGTCTCGGACAGGA 
TGGCAGAGGGTACAGGTGGTGGTGATCTCCTGACAGCTGGACCTCCCACAATGTGAAGCT 
CACGCTTGGGCTGCCCACTCTACCCTTCTCTTTCTCCTTCAGTAGAATAATAATTGTTTT 
TCTAAACATTAAACCATCAAGACTTCTGAAGAGCAGAAGGCTACACTCTG 

SEQ ID NO: 99_AA215311_H 

CGRCCGCGCTACGGAAAGCCGGAGGGGGGCGGGGCCGTCGGCGTAAGGGGGTGTGTCCGC 
GCGCACCACGGGGGCGCGCGCCGGCTGCTGACTGGAGGCGGCGGCAGCGGAGGCGCGAGC 
TGCCCGATAATGGCGGCCTGCAGAGCCCATGAGAGGGAGAAGCGGCAGCGTCTACCCTGA 
GAAACCTCGACCTTGAAGATGGTGAGTAGCCAGCCAAAGTACGATCTAATACGGGAGGTA 
GGCCGAGGTAGTTACGGTGTTGTGTATGAAGCAGTCATCAGAAAGACCTCTGCACGGGTG 
GCAGTGAAGAAAATTCGATGTCACGCACCTGAAAATGTTGAACTAGCCCTTCGTGAGTTC 
TGGGCACTAAGCAGTATCAAGAGCCAACATCCAAATGTGATTCACTTGGAGGAATGCATC 
CTACAAAAGGATGGGATGGTGCAAAAGATGTCCCACGGCTCTAATTCTTCCCTTTATTTA 
CAGCTTGTAGAAACTTCATTAAAAGGAGAAATTGCCTTTGATCCCAGAAGCGCCTATTAT 
TTGTGGTTTGTGATGGATTTTTGTGACGGAGGAGATATGAATGAGTATCTGTTGTCCAGG 
AAGCCCAATCGTAAAACTAACACCAGCTTCATGCTTCAGCTGAGCAGTGCCCTGGCTTTC 
TTGCATAAAAACCAGATCATCCACCGAGATCTTAAGCCTGATAACATCCTGATTTCTCAA 
ACCAGGTTGGATACCAGTGACTTGGAACCTACCCTCAAAGTGGCTGATTTTGGTCTAAGT 
AAAGTTTGTTCAGCCTCTGGGCAGAACCCAGAAGAACCTGTCAGTGTAAACAAGTGTTTC 
CTTTCCACAGCATGTGGAACAGATTTTTACATGGCTCCTGAAGTTTGGGAAGGACATTAC 
ACAGCAAAAGCTGACATCTTTGCTCTGGGGATTATCATCTGGGCAATGCTGGAAAGGATC 
ACATTCATAGACACAGAGACAAAGAAGGAACTCTTGGGGAGTTATGTAAAACAAGGAACT 
GAGATTGTGCCTGTTGGGGAGGCACTTCTGGAAAATCCCAAAATGGAACTTCTCATTCCT 
GTGAAGAAAAAATCTATGAATGGGCGAATGAAACAACTGATTAAGGAAATGCTGGCTGCA 
AACCCTCAGGATCGTCCAGATGCTTTTGAACTAGAACTCAGATTAGTACAAATTGCATTT 
AAAGATAGCAGCTGGGAAACGTGACACATATTATTTGCAAATACCATGGATGATATGCTG 
CTTCTGTTTAACAGTGATGCAACATTATGTGGCTGAAAAAGAATATAAAAAGCTAGACTC 
TACCCTCTAAGGGTTTAGATTTTTTGTGGGATTTTTTTTTTCCTCATTTTTCTTAAATCC 
AAGTTGGCCGTTTTATTAGTATGTTTCAAATGTGTATTACCAATGTGGGTGTAAATTTTT 
AAAAAATGATTATTGATAGAAGTTTGGCAGGAAAATTCTTTAAGAGCTAACAAGAGAAGA 
GAGTCCAGTTTTCTGGAAATATGTCTTTAAGTATTTTAGACATTCCTCGTCAGTATTAGG 
AATTTCCATGGGAAAAGAGGTTTGCATGCTGGTAATGCAACCTTTGAAACTTTGTAAAGG 
AAACATATATGTATATATTTATGTATATGTAAGTATGTGAATGTGCGCATTTTGCATTCC 
ATATGAAAAAAATGCCACGTCTGTTTAAATTATTTGATGTAGGTTTGGGTTTTTGAGATT 
TGCTGGTGAAGTCAGTGACGAAAAATAAACCTTCCCTTATCTTCCTACTCTGCCCCTCCC 
CCTAATGAAATCATATTAAGTNGTTTTTCCTNNTTTTTTTGTAATATACAGCTTTTTTTT 
TAAGGCATCATTTTCGAGGGTCTAAAATTATCTGGTAAAACAAATGAAATTAAGTGATCC 
AAAGCTGCTGAAGTATGTTTGAACTCTCCAGTGCCCTATAGCTGCAAGAGTTGAATTAGT 

CATGCAGTCATATGGCAGCAGGTTGGTGATT 
SEQ ID NO: 100_AA018361_H 

GCGGGGCCTCCGTATCCCCACGTGGGCCCTGCAGGAACTGGCGGGGCGCGTGACCCGGCG 
AGGCCCAGAGACAGGGGAGGGGCGCCGGGAGCCGGGCGGATCCGCGTCCCCGATGCGCGC 
TGCATTTCCGGCGGGCGGCGCTGGGGGCAGCGTGGAGCCACCCAGTGCTCGGCCCGCCCC 
GCAACCCGCCGGAACCGCCGCCCGCAGCGAGGAAGCGCCCGCGCGGGCGCAGGCGGCCGG 
AATGGCGGGGCCCGGCTGGGGTCCCCCGCGCCTGGACGGCTTCATCCTCACCGAGCGCCT 
GGGCAGCGGCACGTACGCCACGGTGTACAAGGCCTACGCCAAGAAGGACACTCGTGAAGT 
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FIGURE 2YYY 

GGTAGCCATAAAGTGTGTAGCCAAGAAAAGTCTGAACAAGGCATCGGTGGAGAACCTCCT 
CACGGAGATTGAGATCCTCAAGGGCATTCGACATCCCCACATTGTGCAGCTGAAAGACTT 
TCAGTGGGACAGTGACAATATCTACCTCATCATGGAGTTTTGCGCAGGGGGCGACCTGTC 
TCGCTTCATCCATACCCGCAGGATTCTGCCTGAGAAGGTGGCGCGTGTCTTCATGCAGCA 
ATTAGCTAGCGCCCTGCAATTCCTGCATGAACGGAATATCTCTCACCTGGATCTGAAGCC 
ACAGAACATTCTACTGAGCTCCTTGGAGAAGCCCCACCTAAAACTGGCAGACTTTGGTTT 
CGCACAACACATGTCCCCGTGGGATGAGAAGCACGTGCTCCGTGGCTCCCCCCTCTACAT 
GGCCCCCGAGATGGTGTGCCAGCGGCAGTATGACGCCCGCGTGGACCTCTGGTCCATGGG 
GGTCATCCTGTATGAAGCCCTCTTCGGGCAGCCCCCCTTTGCCTCCAGGTCGTTCTCGGA 
GCTGGAAGAGAAGATCCGTAGCAACCGGGTCATCGAGCTCCCCTTGCGGCCCCTGCTCTC 
CCGAGACTGCCGGGACCTACTGCAGCGGCTCCTGGAGCGGGACCCCAGCCGTCGCATCTC 
CTTCCAGGACTTCTTTGCGCACCCCTGGGTGGACCTGGAGCACATGCCCAGTGGGGAGAG 
TCTGGGGCGAGCAACCGCCCTGGTGGTGCAGGCTGTGAAGAAAGACCAGGAGGGGGATTC 
AGCAGCCGCCTTATCACTCTACTGCAAGGCTCTGGACTTCTTTGTACCTGCCCTGCACTA 
TGAAGTGGATGCCCAGCGGAAGGAGGCAATTAAGGCAAAGGTGGGGCAGTACGTGTCCCG 
GGCTGAGGAGCTCAAGGCCATCGTCTCCTCTTCCAATCAGGCCCTGCTGAGGCAGGGGAC 
CTCTGCCCGAGACCTGCTCAGAGAGATGGCCCGGGACAAGCCACGCCTCCTAGCTGCCCT 
GGAAGTGGCTTCAGCTGCCATGGCCAAGGAGGAGGCCGCCGGCGGGGAGCAGGATGCCCT 
GGACCTGTACCAGCACAGCCTGGGGGAGCTACTGCTGTTGCTGCGGAGCCCCCGGGCCGG 
AGGCGGGAGCTGCTTCACACTGAGGTTCAGAACCTCATGGCCCGAGCTGAATACTTGAAG 
GAGCAGATGAGGGAATCTCGCTGGGAAGCTGACACCCTGGACAAAGAGGGACTGTCGGAA 
TCTGTTCGTAGCTCTTGCACCCTTCAGTGACCCTAGAAGAATGATTGGACAGATGTGAGC 
CATCTGGAGCAGAGGGGCACTAACCCAGGCTGACGCCAAGAATGAAGTGGCCCACTGCAG 
CCCTGGCGAGCAGGCTTCTTGGATGGACAGTGCTGAGACCCCCATATCCCAGAGTCCCCA 
GCCTCCCTCAGGTTACTCTGCACCCCACAGATGGTTTGATGGCTGTGCTGTATACTGGAG 
GGGAGGGCAGGACTCTGGGAGAACAGCACTTCTTTCATGAGACCTTTGTTACTCGGTGGT 
TACTGGGTCCTGTGCCTGTCCGTTTTGGGGCATGCAGCCCTCTATCATTTTTGGCTCCGA 
GAAGAGGGCAAGGGGCCCCCGCAGGGTACTTCTGTGCTTGCCCTCGCCCTGCCAGCAGGC 
AGCTGTGCCCCTGGCCTGCCCTTCCCGGGACCCCTTATTCCAACTCAGCTCCTCTTTGCA 
CTGGAATGGGGCACTCCAACACCCCTCAGGGACCACCCTCCCCACAGTATGCACTCAGCC 
CCACAGAACCCACCAGTCTTTCTGGGAACTCACACCTGCCCGCCATCTTGGTACTTTAGG 
TTAATCCCTCAAGCATGAAAGCTGGATCTTTTGGGGTTTAAGAAGCCCAAGCCTTGTTCC 
TGCCCTGGCCTAGGGAGCACTCAGGAGGGTTCCTTGGTCCTCATCTCTCCCACCTCCGTT 
CCCTCTGGGCCCCACACTAGCCACAGCGCGGGCCTTGTGCTGGAGTTTGAGCCTGGGACA 
GGGAGAGGGAGGCTTGGAGACAGTCTGACCCAGTGCCCTCTAGGCCACCCACTTCTAGGC 
CTGCCCTGCCGCCGTGGAGCCCTGGGCAAGCTCTTTCCCCTTTCTGGGCCTGGGTCTCCC 
CATCTCTTCAATGGGGCTGATACCTTCACAGCCCACAGCATGGGCACTTATGAGGACAAA 
GTGAATTTAACCTGGAAAAGAATGTATTTGAGAGTTTCTTTTAAATAATCAGCGGGTGTT 
GGTGATTTGTAGCCCTTCTGCCCTTAAATGCTTCCTTGGGCAAGAGCTGTCTGTCCTCCC 
TGCAGGAGGCTGAGTGTGAAGAGTATCATTCATTGTTTCTCTATTAAATTATTTTCTCT 

SEQ ID NO: 101_AA311714_H 

TGGACCTGTCCTGAGGCAGAGGCCGAGATGCGCGCAACCGCGGGAGCAGCCAAGTGGACT 
GGACTCTTTTCTTGACTTAGCTACCAGGAGCTAGAGATGCTGTTATTCTATCGTATGTGA 
GAAGTCGGCCCAGAGATGGAAAACTTTATTCTGTATGAGGAGATCGGAAGAGGAAGCAAG 
ACTGTTGTCTATAAAGGGCGACGGAAGGGAACAATCAATTTTGTAGCCATTCTTTGTACT 
GATAAGTGCAGAAGGCCTGAAATAACCAACTGGGTCCGTCTCACCCGTGAAATAAAACAC 
AAGAATATTGTAACTTTTCATGAATGGTATGAAACAAGCAACCACCTCTGGCTAGTGXAT 
GAAAACCTCCCAGAAGATGTTGTGAGAGAATTTGGAATTGACCTGATTAGTGGATTACAT 
CATCTTCATAAACTTGGCATTCTCTTTTGTGACATTTCTCCTAGGAAGATACTCTTGGAA 
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GGGCCTGGCACACTGAAGTTTAGCAACTTTTGCTTGGCAAAAGTGGAAGGTGAAAATTTG 
GAAGAGTTCTTTGCTTTGGTGGCAGCAGAGGAAGGAGGAGGTGATAATGGGGAAAATGTC 
CTGAAGAAAAGCATGAAAAGTAGAGTCAAAGGATCTCCTGTATATACAGCACCAGAAGTT 
GTGAGGGGTGCTGACTTTTCCATCTCCAGTGACCTCTGGTCTTTGGGCTGTCTGCTTTAT 
GAAATGTTTTCAGGAAAACCTCCATTCTTCTCAGAAAGTGTTTCAGAATTAACTGAAAAG 
ATCTTATGTGAAGATCCTTTGCCACCTATTCCGAAAGATTCTTCTCGTCCTAT^AGCTTCT 
TCAGATTTTATTAATTTGCTTGATGGGTTACTTCAAAGAGATCCTCAGAAAAGATTGACT 
TGGACAAGGCTACTGCAGCATTCATTTTGGAAGAAAGCTTTTGCTGGAGCAGATCAGGAA 
TCAAGCGTCGAAGATCTCAGTCTCAGCAGAAACACTATGGAGTGTTCTGGGCCACAAGAT 
TCCAAGGAGCTTTTGCAGAACTCTCAGAGTAGACAAGCAAAAGGGCACAAGAGTGGTCAA 
CCACTAGGTCACTCTTTCAGACTAGAAAATCCAACTGAGTTTCGGCCTAAGAGTACTCTT 
GAGGGTCAATTGAATGAATCCATGTTTCTTCTCAGTTCTCGTCCTACTCCCAGAACTAGC 
ACTGCAGTGGAAGTAAGTCCTGGTGAGGATATGACTCACTGTTCACCACAGAAGACTTCT 
CCTCTGACCAAGATTACAAGTGGACACCTGAGTCAGCAGGACCTGGAATCCCAGATGAGA 
GAGCTTATCTACACGGACTCAGATCTTGTTGTCACCCCCATTATCGACAATCCAAAGATA 
ATGAAACAGCCACCAGTTAAATTTGATGCAAAAATATTGCATCTACCAACATATTCAGTG 
GATAAGTTATTATTTCTGAAAGATCAAGATTGGAATGACTTTTTGCAACAAGTGTGCTCG 
CAGATCGACTCCACTGAGAAGAGCATGGGGGCCTCCCGAGCCAAGCTGAATCTCCTTTGC 
TATTTGTGCGTGGTGGCTGGTCACCAGGAGGTGGCCACCAGGCTCCTCCATTCCCCCCTG 
TTCCAATTGCTAATCCAGCATTTGCGGATAGCTCCT^AACTGGGATATACGGGCCAAGGTT 
GCTCACGTGATTGGTTTACTGGCTTCGCACACAACTGAGCTCCAGGAAAATACACCTGTT 
GTTGAGACTACAAGCTCCATTGGAATCGGGATTTTGAACTGTCTTGTTCAACACTCCACT 
CCAGTGCCTAGACAGTGCCTTGTGTATGTATAGATACTGACAAATATTTCAAAATAAATA 
AAACTGTATCAGCATT 

SEQ ID NO: 102_SGK3 84_H 

TCTTTGGCCCACGTGCTGAGGGCGCGGCAGATCCTGACGGAGCCAGAAGTGCGCGACTAC 
CTGCGGGGCCTGGTCAGCGGCCTGCGCTACCTGCACCAGCGGTGCATCCTGCACCGC 

SEQ ID NO: 103_AA2104 51__M SGK3 84_M 

GGTCTGCTGCATGGATAATGGACTGGAACACAGAAAGACCATGCAGGGTTCGGCTGTAGA 
AGGCCAGTATCTCCAGAGGCCAGAAGACACCATCAGATCTCCTGGGACTGGAGTTATAGA 
GGTTGTGAGCTGCCATGTTGAACCAAGCAGGTCACTGAGGGACACAGGCATGTGGATGGA 
AACCCTGCTGGGAGAAAAAAGAAACTGCTGAAGGGACTGACATGGGACAGCAACATGGAA 
CCAGGAATGGTCTCACGCATAGAGAGCTCCCCCGGGGCGTGGGGCTGCTGCTCGCCATGG 
CCCTTATGAACGTGGCGCTCTACCTCTGCCTTGATCAGCTTTTCATCTCCCCTGGACGAT 
CCACCGCGGACTCTAGGCGCTGTCCTCCGGGCTACTTCAGAATGGGGCGGATGAGAAACT 
GCTCACGCTGGCTGTCCTGTGAAGAGCTGAGGACAGAAGTCAGGCAGCTGAAGCGCGTTG 
GGGAGGGAGCCGTGAAGAGAGTCTTTCTGTCTGAATGGAAGGAACACAAAGTCGCTCTCT 
CCCGGCTCACCAGGCTGGAGATGAAGGAGGACTTCCTGCATGGGCTGCAGATGCTGAAGT 
CTCTACAGAGTGAGCACGTGGTCACGCTGGTGGGCTACTGTGAGGAAGATGGCACTATTC 
TCACCGAATATCACCCCTTAGGTTCCTTGAGCAACCTGGAAGAAACACTAAACCTTTCAA 
AGTACCAAGACGTGAACACTTGGCAGCACAGGCTGCAGCTGGCCATGGAGTACGTCAGCA 
TCATTAACTATCTGCATCACAGCCCCCTGGGCACGAGGGTCATGTGTGACTCTAACGACC 
TGCCCAAAACATTGTCCCAGTACCTGCTAACAAGTAACTTCAGCATTGTGGCAAACGACC 
TGGACGCTCTGCCCCTGGTAGACCATGACTCTGGGGTACTTATAAAGTGTGGCCACAGAG 
AGCTCCATGGGGATTTTGTGGCTCCAGAGCAGCTGTGGCCCTACGGAGAAGACACGCCCT 
TCCAAGACGATCTCATGCCTTCCTACAATGAGAAGGTTGACATCTGGAAGATTCCAGATG 
TCTCCAGTTTCCTCTTGGGGCACGTGGAAGGGAGTGATATGGTTAGATTCCATTTGTTTG 
ATATCCATAAGGCGTGCAAGAGCCAGATCCCGGCAGAAAGACCCACTGCTCAGAACGTGC 
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TAGACGCTTACCAGAGGGTTTTCCATTCACTCCGAGACACTGTGATGTCGCAGACGAAAG 
AAATGCTGTAAAAATGAGCCATCGAGTGACGTGCTTGATGGCTGAATGGCATCCCAGCTG 
TTCCGCTCTTGATGATGGAAGAGCTTTGCATGGATGGATGTTGACCCTGGCTGTTCAGCC 
ACGTAGGCCTCCTCTACGTCTGCCTGCATGTTTGAGTGTTCTGCTCTCCTGGCAGCCCGG 
ATGGAAGCTGCCAAGCGAGAAAGCCTGGCTTCAGGATGCTCCCTGGTGAAGATGCAGAGG 
ATTCTGGATCTGCATAGTTTCAAGGGAGTGATCAAACGGTGACCTTGAAGACATGCTGCC 
TGCCTTGGTAACTTTTTATAGACTAGTAGGAAACAGAAATCTTTTGGGGGAGGGGGGGAC 
AACCCACTAGTTCCTCAGAGACAATTTCTTCTCATTCAGAAAGCCCTGTTGGAAGCTGGG 
GATGTTTTAACTCCGTGGCAGGGCACTTGCCTAGTTGTGTGCAAAGCCTTGGATCTGACC 
CATGGCATGTGCACACACACAAATGCTCAAAGAAT^ATCCCAGACGCCAGAAGTGTGCCCC 
TTTCTTGTCAATAAGGTCATTGTCAGTACCGGAGATGATTTTTTTTATGAAGCGTTTATG 
CTGACTCGTGTCACTGAGCCAAGTGTGCATGGTCGTTAGCTACTTTGTGGGTTCTTCTTT 
CTTTCTACCCTACTTCTTCCCTTTCCACCCCCTT^ACACTAGATAGGAGAGAGGAGAGAGA 
AAGGAAAGTGGGCACTGTTATATTGTTGGACGACTTCTTGCTGATTAAGGGGTGTCGAGT 
TCCTTGGAGCAATGATCTTTGCTGCCAAGATATCTCATTTCTTCTTGTTTCTTCTTCGCC 
CACGACCACTTCACAAACACCGACCAACAGCAAACAACAACCCACCCCGCTTCTCGGGGG 
CCCTAGCACTTATGTACTTCTGAAAAGTCCCCAGAAATTCCAATCATCACACACTCAGAG 
AAACTGTCTGCTGCTGGCAAAACTACACCCCTGCTAGAGCATGAGGCAAATCATAGTCAG 
CTGCTGTGGACAGTCTGAAGCAGCCTGGCATCCCACACCTGAGATTAAAACAAAAACATT 
CTTACCTGTGTTTTGTTTTTGTTTTAAGAAACCAAAGTGCACCAAGATAGCATGCTCTTG 
AGATTGTGGCTGTCTAGAGATTTTTGGAACAGCAAGTTGAAGGAACTTTCTTACCTGCCT 
TGAATGGTGCTTTGAACTTCCTGCTGACCTGGAGTTTCTGTGTGAATATTTCTATCCAGT 
GTCCCCCTGTACCGGAAAGTACAAAGTCTGCTCTGGGCTTGCATGCCTGAACACTTTAAA 
ACACTGTGGAGCCAGGAATAATGGTACCCACCTGTAATCCCAGCACCTGGGAGACAGGAG 
GAACCAGGAGTTCAGGGTTATCCTGGGCTATATACCGTGACCCTGTCTACCCCCACACCC 
CAATAAAAAAACAAAAAGGTC 

SEQ ID NO: 1 04_SGK071_2_H 

GAGGTGGTGGCTGTGCAGATGATGGTGGAATGCATGGATGACCATTACGCCAGTCAGGCC 
CTGGAGGAGCTGATGCCACTGCTGAAGCTGCGGCACGCCCACATCTCTGTGTACCAGGAG 
CTGTTCATCACGTGGAATGGGGAGATCTCTTCTCTGTACCTCTGCCTGGTGATGGAGTTC 
AATGAGCTCAGCTTCCAGGAGGTCATTGAGGATAAGAGGAAGGCAAAGAAAATCATTGAC 
TCTGAGTGGATGCAGAATGTGCTGGGCCAGGTGCTGGACGCGCTGGAATACCTGCACCAT 
TTGGACATCATCCACAGGAATCTCAAACCCTCCAACATCATCCTCATCAGCAGTGACCAC 
TGCAAACTGCAGGACCTGAGTTCCAATGTGCTAATGACAGACAAAGCCAAATGGAATATT 
CGTGCGGAGGAAGACCCCTTTCGTAAGTCCTGGATGGCCCCTGAAGCCCTCAACTTCTCC 
TTCAGCCAGAAATCAGACATCTGGTCCCTGGGCTGCATCATTCTGGACATGACCAGCTGC 
TCCTTCATGGATGGCACAGAAGCCATGCATCTGCGGAAGTCCCTCCGCCAGAGCCCAGGC 
AGCCTGAAGGCCGTCCTGAAGACAATGGAGGAGAAGCAGATCCCGGATGTGGAAACCTTC 
AGGAATCTTCTGCCCTTGATGCTCCAGATCGACCCCTCGGATCGAATAACGATAAAGGAC 
GTGGTGCACATCACCTTCTTGAGAGGCTCCTTCAAGTCCTCGTGCGTCTCTCTGACCCTG 
CACCGGCAGATGGTGCCTGCGTCCATCACCGACATGCTGTTAGAAGGCAACGTGGCCAGC 
ATTTTAGGTGATGCTGGGGACACAAAGGGGGAGCGTGCCCTGAAGCTCCTGTCCATGGCC 
TTGGCATCCTATTGTTTAGTTCCAGAGGGTTCATTATTTATGCCCCTGGCCTTGCTCCAC 
ATGCACGACCAGTGGCTCAGCTGTGACCAGGACAGAGTCCCTGGGAAGAGAGACTTTGCC 
TCCCTGGGGAAACTAGGGAAGCTGTTGGGCCCCATCCCAAAGGGTCTGCCGTGGCCCCCG 
GAGCTGGTGGAGGTGGTGGTCACGACCATGGAGCTACATGACAGGGTCCTCGATGTCCAG 
CTGTGTGCCTGCTCCCTGCTGCTGCACCTCCTGGGCCAAGCGCTGGTGCACCACCCGGAA 
GCCAAGGCTCCCTGCAACCAAGCCATCACCTCCACCCTGCTGAGTGCTCTTCAGAGCCAC 
CCCGAGGAGGAGCCACTTCTTGTCATGGTCTACAGCCTGCTAGCCATCACCACAACCCAG 



H/li) 
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GAGTCAGAGTCACTGTCAGAGGAGCTGCAGAACGCTGGGCTGCTGGAGCACATCCTGGAG 
CACCTCAACAGCTCCCTCGAAAGCAGGGACGTCTGCGCCAGCGGCCTGGGCCTGCTCTGG 
GCCCTCCTGCTGGACGACCCCATCTTGGCACTCCAGCGCCCCAGGAAAAAGAGAGCTCCA 
AACCACGGAAAGCCCGGGAAACCCAAGAACCCTGCCAGCACCCAAAGTATCATTGTGAAC 
AAGGCCCCCTTGGAGAAGGTCCCGGACCTCATCAGCCAGGTGTTGGCCACCTACCCTGCG 
GATGGGGAAATGGCAGAAGCCAGCTGCGGAGTCTTCTGGCTGCTGTCCCTGCTGGGCTGC 
ATCAAGGAGCAGCAGTTTGAACAAGTGGTGGCGCTGCTCCTGCAAAGCATCCGGCTGTGC 
CAGGACAGAGCCCTGCTGGTGAACAATGCCTACCGGGGACTGGCCAGCCTGGTGAAGGTG 
TCAGAGCTGGCGGCCTTCAAGGTGGTGGTGCAGGAGGAGGGCGGCAGTGGCCTCAGCCTC 
ATCAAGGAGACCTACCAGCTCCACAGGGACGACCCGGAGGTGGTGGAGAACGTGGGCATG 
CTGCTGGTCCACCTGGCTTCCTATGAGGAGATCCTGCCGGAGCTGGTGTCCAGTAGTATG 
AAGGCCCTGCTCCAGGAGATCAAGGAGCGCTTCACCTCCAGCCTGGTGAGTGACAGCAGC 
GCCTTCAGCAAACCAGGCCTCCCTCCAGGTGGAAGCCCCCAGCTGGGGTGCACCACGTCT 

GGGGGACTGGAATAG 

SEQ ID NO: 105_AA118352_M SGK071_M 

CAGAAGAAGACCCCTGCCAGAAGTCCTGGATGGCTCCTGAAGCTCTCAAATTCTCCTTCT 
CCACCAAATCCGACATCTGGTCTCTGGGCTGCATCATTCTAGACATGGCCACTTGCTCCT 
TCCTGAACGACACAGAAGCCATGCAACTGCGGAAGGCCATCCGCCATCATCCAGGCAGCC 
TGAAGCCCATCCTGAAAACCATGGAGGAGAAGCAAATCCCTGGTACAGATGTCTACTATT 
TGCTTCTGCCCTTCATGTTGCATATCAACCCCTCCGATCGACTGGCAATCAAGGATGTGA 
TGCAAGTCACCTTCATGAGCAACTCCTTCAAAAGCTCCTCTGTTGCGCTGAATATGCAGC 
GGCAGAAGGTCCCCATCTTCATCACTGACGTGCTGCTTGAAGGCAACATGGCCAACATCT 
TAGGCAGCTGGCTGTGTGCTTCCTTTGTGAACGACAGCAGGCACTGTGACTCAGGGATTG 
GCTCGCAGAGACTTGGGTTTGATTTTCAGTCAGTCTCTTGGACAGAGCACCCTCTGAAAG 
ATGTCATGCAGAATTTCTCCAGTCGACCAGAGGTCCAGCTCAGAGCCATTAACAAGTTGT 
TGACAATGCCAGAGGACCAGCTAGGGCTGCCATGGCCCACAGAGCTGCTGGAAGAGGTGA 
TCAGCATCATAAAGCAGCATGGGCGGATCCTGGATATTCTGCTCAGCACCTGCTCCCTTC 
TGCTGCGTGTTCTTGGCCAAGCACTGGCAAAGGACCCAGAAGCTGAGATCCCAAGGAGCA 
GTTTGATCATCTCCTTCCTGATGGATACCTTGCGGAGCCATCCTAACTCTGAAAGGCTTG 
TTAATGTGGTCTACAACGTGCTTGCCATTATTTCCAGCCAAGGACAGATCTCAGAAGAGC 
TGGAAGAGGAGGGGTTGTTTCAGCTTGCCCAAGAGAACCTGGAGCACTTCCAAGAGGACA 
GGGACATCTGCCTCTCTATCCTGAGCCTGCTCTGGTCCCTCCTGGTAGATGTTGTCACTG 
TGGACAAAGAGCCCTTGGAGCAGCTCTCTGGCATGGTCACCTGGGTGCTGGCTACTCATC 
CGGAGGACGTGGAAATAGCAGAGGCTGGCTGTGCGGTGCTCTGGCTGCTGTCCTTGTTGG 
GCTGCATAAAGGAGAGTCAGTTTGAGCAGGTGGTAGTGCTGCTCCTGAGAAGCATCCAGC 
TGTGCCCTGGCAGAGTACTGCTGGTGAACAATGCATTCCGTGGCTTGGCCAGCCTCGCAA 
AGGTGTCCGAACTGGTGGCCTTCCGAATAGTAGTACTGGAAGAGGGCAGCAGCGGCCTCC 
ACCTCATCCAAGATATCTACAAGCTCTACAAGGATGACCCTGAGGTGGTGGAGAACCTCT 
GCATGCTGTTGGCCCATCTGACCTCCTACAAGGAGATCCTGCCAGAGATGGAGTCTGGAG 
GCATCAAAGACCTAGTCCAGGTGATCCGGGGGCGCTTTACCTCCAGCCTGGAGCTGATTT 
CTTACGCTGATGAGATACTCCAGGTACTGGAAGCAAATGCACAACCTGGCGTCCAGGAGG 
ATCAGCTTGAGCCTCCTGCAGGGCAGGAAGCCCCACTGCAGGGAGAGCCCCTCTTCAGGC 
CCTGACATGCTGCCCTTCTGGTCCTGTGGTAAGAGAAAGTATCACTAGGTCCAGTATTAA 
TTTCGTACCCCATGGTGACTAATAAAAGAAGCCCTAGGCTGTTTCTGGC 

SEQ ID NO: 106_018653 . 9_H 

GGCCGGGGTCGGGGCGCGGGGCATGCGCGCGGGCTGGGCAGGGGGCCGGCGGGGCGCAGA 
GCGGAGCCGCCTCGGAGCCTGAGCCGCCCGGGGCCGGGGCCGGGGAGCCGCGCGGGGCCG 
GCCGGCCGGGGGGAGGGGAGCGATGCGGCGCCGGCGGGCGGCAGTGGCCGCGGGTTTCTG 
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CGCCTCCTTCCTGCTGGGCTCCGTCCTCAACGTGCTCTTCGCTCCGGGTCGGAGCCTCCG 

AGGCCAGGCCAGTCCCCTGAGCCTTCGCCGGCCCCGGGTGCGGGCCGTCGCGGGGGCCGC 

GGGGAGCTGGCCCGGCAGATCCGGGCGCGCTACGAGGAGGTGCAGCGCTATTCCCGCGGG 

GGCCCCGGGCCCGGGGCGGGCCGGCCGGAGCGGCGGCGCCTGATGGACCTGGCTCCGGGC 

GGGCCCGGCCTGCCGCGCCCCCGGCCCCCTTGGGCCCGGCCCCTGTCCGACGGCGCCCCA 

GGCTGGCCCCCGGCTCCCGGCCCAGGCTCCCCCGGCCCGGGCCCGCGCCTGGGCTGCGCC 

GCGCTTCGCAACGTGTCCGGCGCGCAGTACATGGGCTCAGGCTACACCAAGGCCGTGTAC 

CGGGTCCGCCTGCCCGGCGGTGCCGCGGTGGCGCTCAAGGCGGTGGACTTTAGCGGCCAC 

GATCTGGGCAGCTGCGTGCGCGAGTTCGGGGTACGGAGGGGCTGCTATCGGCTGGCGGCC 

CACAAGCTGCTTAAGGAGATGGTGCTGCTGGAGCGGCTGCGGCACCCCAACGTGCTGCAG 

CTCTATGGCTACTGCTACCAGGACAGCGAGGACATCCCAGACACCCTGACCACCATCACG 

GAGCTGGGCGCCCCTGTAGAAATGATCCAGCTGCTGCAAACTTCCTGGGAGGATCGATTC 

CGAATCTGCCTGAGCCTGGGCCGCCTCCTCCACCACCTGGCCCACTCCCCACTGGGCTCC 

GTCACTCTGCTGGACTTCCGCCCTCGGCAGTTTGTGCTGGTGGATGGGGAGCTCAAAGTG 

ACGGACCTGGATGACGCACGTGTGGAGGAGACGCCGTGTGCAGGCAGCACCGACTGCATA 

CTCGAGTTTCCGGCCAGGAACTTCACCCTGCCCTGCTCAGCCCAGGGCTGGTGCGAGGGC 

ATGAACGAGAAGCGGAACCTCTATAATGCCTACAGGTTTTTCTTCACATACCTCCTGCCT 

CACAGTGCCCCGCCTTCACTGCGTCCTCTGCTGGACAGCATCGTCAACGCCACAGGAGAG 

CTCGCCTGGGGGGTGGACGAGACCCTGGCCCAGCTGGAGAAGGTGCTGCACCTGTACCGG 

AGCGGGCAGTATCTGCAGAACTCCACGGCAAGCAGCAGTACCGAGTACCAGTGTATCCCA 

GACAGCACCATCCCCCAGGAAGACTACCGCTGCTGGCCATCCTACCACCACGGGAGCTGC 

CTCCTTTCAGTGTTCAACCTGGCTGAGGCTGTGGATGTCTGTGAGAGCCATGCCCAGTGT 

CGGGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGCAGCTGGTCTTTTTCAAG 

ACTGGATGGAGCCAAGTGGTCCCTGATCCCAACAAGACCACATATGTGAAGGCCTCTGGC 

TGACCTATCTGAGGGCTCGGCTGACCAGCTGACTATCCTCAGCAGCTGGGCTTGCCTGTG 

GAGGGAGTGACTTGCACTGGCAGCACTGCATGTCACCTGGGAACCCCTGCAGACAAAGCT 

AACATCCCAGACAGACAGATGTGACCAGGACAAACGTGCAATAATGCCAAATGTTAAAAT 

GTGAGTTTACCAGCCTAGCTATGGGACTGCTGGCTCCTAGTCCAGGAATCATGGGGGTAT 

GACTGCCTCTCCAACCCTGTGGGCTGTAAGCAAGCTCAGGCTAGTCTCCCCACTGGGGGC 

TGTGCCCCTCCCTGGGACGGTTCCGTGGGCAGCCCCATCACTGTGTTCAATAGTGTGAGA 

ATGTAGCTAAAGCCCCTGCTGCTGCTGCTGCACATGCCACAGCAGGCGGTGGGGGCTGCG 

TGGGGACAATCCATCGTGGAGTGTTCTCTCAGCTTAGGTCTGGACAGGAGACTTGGCGGG 

AGATGCTCCAGGATGTGGGTGATTCTGTACCTGGGGAGGCTATCTCTGACCTCCCGACAG 

GGGACACTCCCAGGCCAGCCCAGGGGTCAGGGGCAGAGGTGCACACCTCAGCATGAGCCA 

AGACTGGGGTCAGGGAGCAGGTGTGGTTTGAGCCAGGACCTGGGGCGGGGGTGGGGCCGG 

GGCCTTTCTGCCTCATTTGCTTTCAATGAAAGCCTCAAAGCAGCCAAAACCAGGCTTTCC 

CCCTTCCTCGAGTTTGAATATCCAGAATCTTTTGTACTTCTTGTTGGTTAAATTGTTTAT 

TTTTGTAAAAAATAAAATAAAATTAGTTAATAAAATGATGTTTCACAGCAAACTCTTCCC 



SEO ID NO: 107 AA396601 M 

CCACGCGTCCGGGCTGCGCCGCGCTCCGCAACGTGTCTGGCGCGCAGTACGTGGGCTCAG 
GCTACACTAAGGCTGTGTACCGGGTCCGCCTGCCCGGCGGCGCCGCGGTGGCGCTTAAAG 
CAGTGGACTTCAGCGGCCACGATCTGGGCAGCTGCGTGCGCGAGTTCGGGGCGCGAAGGG 
GCTGCTATCGCCTGGCGGCCCACAAGCTGCTCAAAGAGATGGTGCTGCTGGAGCGGCTGC 
GGCACCCCAACGTGCTGCAGCTCTATGGCTATTGCTACCAGGACAGTGAGGGCATCCCAG 
ACACGCTGACCACCATCACAGAGCTGGGTGCCCCTGTGGAGATGATCCAGCTGTTGCAGA 
CTTCCTGGGAGGATCGATTCCGAATCTGCCTCAGCCTTGGCCGCCTCCTCCACCACCTGG 
CCCACTCCCCGCTGGGCTCGGTCACCCTGCTTGACTTCCGCCCTCGGCAGTTTGTGCTAG 
TGAACGGGGAGCTGAAAGTGACAGACCTGGATGATGCCCGCGTGGAAGAGACACCGTGCA 
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CCAGCAGTGCCGACTGCACGCTAGAGTTTCCAGCCAGGAACTTCAGCCTGCCCTGCTCGG 
CCCAGGGCTGGTGCGAGGGCATGAATGAGAAACGGAACCTCTACAATGCCTACAGGTTCT 
TCTTCACATACCTCCTGCCACACAGTGCCCCGCCTTCCCTCCGACCTCTCCTGGATAGCA 
TCGTCAATGCCACGGGAGAGCTCGCCTGGGGGGTGGATGAGACCCTGGCCCAGCTGGAGA 
CAGCGCTACACTTGTTCCGAAGTGGGCAGTACCTGCAGAACTCTACAAGCAGCAGGGCTG 
AGTACCAGCGCATCCCGGACAGTGCCATCACACAGGAGGACTATCGCTGCTGGCCATCCT 
ATCACCACGGCGGCTGCCTCCTGTCCGTGTTCAACCTGGCTGAGGCTATAGATGTCTGTG 
AGAGCCATGCTCAGTGTCGTGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGA 
AGCTGGTCTTTTTTAAGACTGGATGGAACCAAGTGGTCCCTGATGCCGGCAAGACCACAT 
ATGTGAAGGCCCCTGGTTGACTGGTTGTGGGCTCAGCTGACCAGCTGGGCTTGCCTGCTG 
CAGGCGTGACTTGCATCCCACCTGGGAACCCCTGCAGACAAAAGCTAGCTCCCAGAGCAA 
CTGATGTGACCAGGACAAAACGTGCAATATGCAAAAATGTTAAAATGTGAGTTTGCCAGC 
TTCAGTCCCAGACTGGTTGGAACCCGATTGCCTCTCTGGAGCTGTAGGCTGTGAGCAGGG 
CTCAGGCTGGTCTTAACTGGGACAGTCCCGTGGGCAGCCCATTACTGCATTCATGCTTTG 
AGAATGTAGCCAGAACACTGCTGCTGCATAAGCCACCGTGGGCAGGAGCTGCCTGGGGAC 
AACCAGTCTCAGAGTGCTCTCTCAGCTCAGCTCCGCTCCAAATGGAGAGCGCGGGATGCG 
GAGATGTGAGTGAACCAGCACTGGGAAGAAGGCTCTCGGGCCTCTCCCTAGAGGTTGCTC 
CTAGGCCAGCCCCGAGGCCGTGGGCAGCAGTGCTCGCATCCATATGAGCCAAGACTAGAG 
TGGAGGAGCAGATTGCATTTGAGCCAGGACTGGGGTGGGGGTAGGGTCGGGGCCTCTCTG 
CCTCATTTGCTTTCAGTGAAAGCCAGGGAGCAGCCGCAGCCAGGCTCCTCCCACTCCTGG 
AGGCCAGGCTCCTCCCCCTCCTGGAGGCCAGGCTCCTCCCCCTCCTGGAGTTTGCGTACC 
CAGAAGCTTTTATACTTCTCGTTCATTAAATTGTTTATTTTTGTAAAAAAAAAAATTAAT 

CAATTAATAAAATGATGTTTTGTGAC 
SEQ ID NO: 108_VRK3_H 

ATGATCTCCTTCTGTCCAGACTGTGGCAAAAGTATCCAAGCGGCATTCAAATTCTGCCCC 
TACTGTGGAAATTCTTTGCCTGTAGAGGAGCATGTAGGGTCCCAGACCTTTGTCAATCCA 
CATGTGTCATCCTTCCAAGGCTCAAAGAGAGGGCTGAACTCCAGTTTTGAAACCTCTCCT 
AAGAAAGTGAAATGGTCCAGCACCGTCACCTCTCCCCGATTATCCCTCTTCTCAGATGGT 
GACAGTTCTGAGTCTGAAGATACTCTGAGTTCCTCTGAGAGATCCAAAGGCTCCGGGAGC 
AGACCCCCAACCCCCAAAAGCAGCCCTCAGAAGACCAGGAAGAGCCCTCAGGTGACCAGG 
GGTAGCCCTCAGAAGACCAGCTGTAGCCCTCAGAAGACCAGGCAGAGCCCTCAGACGCTG 
AAGCGGAGCCGAGTGACCACCTCACTTGAAGCTTTGCCCACAGGGACAGTGCTGACAGAC 
AAGAGTGGGCGACAGTGGAAGCTGAAGTCCTTCCAGACCAGGGACAACCAGGGCATTCTC 
TATGAAGCTGCACCCACCTCCACCCTCACCTGTGACTCAGGACCACAGAAGCAAAAGTTC 
TCACTCAAACTGGATGCCAAGGATGGGCGCTTGTTCAATGAGCAGAACTTCTTCCAGCGG 
GCCGCCAAGCCTCTGCAAGTCAACAAGTGGAAGAAGCTGTACTCGACCCCACTGCTGGCC 
ATCCCTACCTGCATGGGTTTCGGTGTTCACCAGGACAAATACAGGTTCTTGGTGTTACCC 
AGCCTGGGGAGGAGCCTTCAGTCGGCCCTGGATGTCAGCCCAAAGCATGTGCTGTCAGAG 
AGGTCTGTGCTGCAGGTGGCCTGCCGGCTGCTGGATGCCCTGGAGTTCCTCCATGAGAAT 
GAGTATGTTCATGGAAATGTGACAGCTGAAAATATCTTTGTGGATCCAGAGGACCAGAGT 
CAGGTGACTTTGGCAGGCTATGGCTTCGCCTTCCGCTATTGCCCAAGTGGCAAACACGTG 
GCCTACGTGGAAGGCAGCAGGAGCCCTCACGAGGGGGACCTTGAGTTCATTAGCATGGAC 
CTGCACAAGGGATGCGGGCCCTCCCGCCGCAGCGACCTCCAGAGCCTGGGCTACTGCATG 
CTGAAGTGGCTCTACGGGTTTCTGCCATGGACAAATTGCCTTCCCAACACTGAGGACATC 
ATGAAGCAAAAACAGAAGTTTGTTGATAAGCCGGGGCCCTTCGTGGGACCCTGCGGTCAC 
TGGATCAGGCCCTCAGAGACCCTGCAGAAGTACCTGAAGGTGGTGATGGCCCTCACGTAT 
GAGGAGAAGCCGCCCTACGCCATGCTGAGGAACAACCTAGAAGCTTTGCTGCAGGATCTG 
CGTGTGTCTCCATATGACCCCATTGGCCTCCCGATGGTGCCCTAG 
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SEQ ID NO: 109_S71575_M VRK3_M 

CCATCCCCACCTGTATCGGCTTTGGCATTCACCAGGACAAGTACAGGTTCCTAGTATTCC 
CCAGCCTGGGGAGGAGCCTTCAGTCAGCCCTGGATGACAACCCAAAGCATGTGGTATCAG 
AGAGATGTGTGCTTCAGGTGGCCTGCAGGCTGCTGGATGCTCTGGAGTATCTCCATGAAA 
ATGAGTATGTTCACGGGAACCTGACAGCTGAGAATGTCTTTGTGAATCCAGAGGATCTGA 
GCCAGGTGACCCTGGTGGGCTATGGCTTCACCTACCGATACTGCCCAGGTGGCAAACACG 
TGGCCTACAAAGAAGGCAGCAGGAGTCCACACGATGGGGACTTGGAGTTCATTAGCATGG 
ACCTGCACAAGGGATGCGGACCCTCCCGCCGCAGCGATCTCCAGACCTTGGGCTACTGTA 
TGCTCAAGTGGCTTTATGGGTCCCTGCCATGGACAAATTGCCTTCCCAACACCGAAAAGA 
TAACTAGGCAGAAGCAGAAGTATCTGGACAGCCCCGAGCGCCTCGTGGGACTGTGTGGCC 
GCTGGAACAAGGCCTCAGAGACCCTGCGGGAGTACCTGAAGGTGGTGATGGCCCTCAATT 
ATGAGGAGAAGCCACCCTATGCCACGCTGAGGAACAGCCTAGAAGCTCTGCTGCAGGATA 
TGCGGGTGTCACCCTATGACCCTCTGGACCTCCAGATGGTGCCTTAGATGGAATCCAGAG 
CTTCCGACTTGCAGCTTGAAGTAGAACATGAAGTAGTGTGACTGGAGGCCTGTTTGAACT 
CATAGCTCCTAAAAGAATCCCTTGAATGTGCATTCTCACCGCTCCCTTAGGACATATGAA 
TCAGCACTTGTGTTGGGGAACCTGAGTCATGTCATGTAATGTGAAACTCCTCCCTGTCTC 
AGCTCTGGCAGCTGTGGATGGAGGTAAGTGGATGCTGGCGGCGGCGGCGGCAGCAGCCAC 
TCCACTCCCTATGGCATTTCTGTGATGGCATAATAAACTGTTTTTAATC 

SEQ ID NO: 1 1 0_AA4 5427_H 

ATGGGCCACGCGCTGTGTGTCTGCTCTCGGGGAACTGTCATCATTGACAATAAGCGCTAC 
CTCTTCATCCAGAAACTGGGGGAGGGTGGGTTCAGCTATGTGGACCTAGTGGAAGGGTTA 
CATGATGGACACTTCTACGCCCTGAAGCGAATCCTGTGTCACGAGCAGCAGGACCGGGAG 
GAGGCCCAGCGAGAAGCCGACATGCATCGCCTCTTCAATCACCCCAACATCCTTCGCCTC 
GTGGCTTACTGTCTGAGGG7^ACGGGGTGCTAAGCATGAGGCCTGGCTGCTGCTACCATTC 
TTCAAGAGAGGTACGCTGTGGAATGAGATAGAAAGGCTGAAGGACAAAGGCAACTTCCTG 
ACCGAGGATCAAATCCTTTGGCTGCTGCTGGGGATCTGCAGAGGCCTTGAGGCCATTCAT 
GCCAAGGGTTATGCCCACAGAGACTTGAAGCCCACCAATATATTGCTTGGAGATGAGGGG 
CAGCCAGTTTTAATGGACTTGGGTTCCATGAATCAAGCATGCATCCATGTGGAGGGCTCC 
CGCCAGGCTCTGACCCTGCAGGACTGGGCAGCCCAGCGGTGCACCATCTCCTACCGAGCC 
CCAGAGCTCTTCTCTGTGCAGAGTCACTGTGTCATCGATGAGCGGACTGATGTCTGGTCC 
CTAGGCTGCGTGCTATATGCCATGATGTTTGGGGAAGGCCCTTATGACATGGTGTTCCAA 
AAGGGTGACAGTGTGGCCCTTGCTGTGCAGAACCAACTCAGCATCCCACAAAGCCCCAGG 
CATTCTTCAGCATTGCGGCAGCTCCTGAACTCGATGATGACCGTGGACCCGCATCAGCGT 
CCTCACATTCCTCTCCTCCTCAGTCAGCTGGAGGCGCTGCAGCCCCCAGCTCCTGGCCAA 
CATACTACCCAAATCTGA 

SEQ ID NO: 111_H05721_H 

CCCTGAGGCACCGCCCCAAGTTTGGTGTGACCGGCGGGGGACGCCGGTGGTGGCGGCAGC 
GACGGCTGCGGGGGCACCGGGCCGCGGCGCCACCATGGCGGTGCGACAGGCGCTGGGCCG 
CGGCCTGCAGCTGGGTCGAGCGCTGCTGCTGCGCTTCACGGGCAAGCCCGGCCGGGCCTA 
CGGCTTGGGGCGGCCGGGCCCGGCGGCGGGCTGTGTCCGCGGGGAGCGTCCAGGCTGGGC 
CGCAGGACCGGGCGCGGAGCCTCGCAGGGTCGGGCTCGGGCTCCCTAACCGTCTCCGCTT 
CTTCCGCCAGTCGGTGGCCGGGCTGGCGGCGCGGTTGCAGCGGCAGTTCGTGGTGCGGGC 
CTGGGGCTGCGCGGGCCCTTGCGGCCGGGCAGTCTTTCTGGCCTTCGGGCTAGGGCTGGG 
CCTCATCGAGGAAAAACAGGCGGAGAGCCGGCGGGCGGTCTCGGCCTGTCAGGAGATCCA 
GGCAATTTTTACCCAGAAAAGCAAGCCGGGGCCTGACCCGTTGGACACGAGACGCTTGCA 
GGGCTTTCGGCTGGAGGAGTATCTGATAGGGCAGTCCATTGGTAAGGGCTGCAGTGCTGC 
TGTGTATGAAGCCACCATGCCTACATTGCCCCAGAACCTGGAGGTGACAAAGAGCACCGG 
GTTGCTTCCAGGGAGAGGCCCAGGTACCAGTGCACCAGGAGAAGGGCAGGAGCGAGCTCC 
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FIGURE 2FFFF 

GGGGGCCCCTGCCTTCCCCTTGGCCATCAAGATGATGTGGAACATCTCGGCAGGTTCCTC 
CAGCGAAGCCATCTTGAACACAATGAGCCAGGAGCTGGTCCCAGCGAGCCGAGTGGCCTT 
GGCTGGGGAGTATGGAGCAGTCACTTACAGAAAATCCAAGAGAGGTCCCAAGCAACTAGC 
CCCTCACCCCAACATCATCCGGGTTCTCCGCGCCTTCACCTCTTCCGTGCCGCTGCTGCC 
AGGGGCCCTGGTCGACTACCCTGATGTGCTGCCCTCACGCCTCCACCCTGAAGGCCTGGG 
CCATGGCCGGACGCTGTTCCTCGTTATGAAGAACTATCCCTGTACCCTGCGCCAGTACCT 
TTGTGTGAACACACCCAGCCCCCGCCTCGCCGCCATGATGCTGCTGCAGCTGCTGGAAGG 
CGTGGACCATCTGGTTCAACAGGGCATCGCGCACAGAGACCTGAAATCCGACAACATCCT 
TGTGGAGCTGGACCCAGACGGCTGCCCCTGGCTGGTGATCGCAGATTTTGGCTGCTGCCT 
GGCTGATGAGAGCATCGGCCTGCAGTTGCCCTTCAGCAGCTGGTACGTGGATCGGGGCGG 
AAACGGCTGTCTGATGGCCCCAGAGGTGTCCACGGCCCGTCCTGGCCCCAGGGCAGTGAT 
TGACTACAGCAAGGCTGATGCCTGGGCAGTGGGAGCCATCGCCTATGAAATCTTCGGGCT 
TGTCAATCCCTTCTACGGCCAGGGCAAGGCCCACCTTGAAAGCCGCAGCTACCAAGAGGC 
TCAGCTACCTGCACTGCCCGAGTCAGTGCCTCCAGACGTGAGACAGTTGGTGAGGGCACT 
GCTCCAGCGAGAGGCCAGCAAGAGACCATCTGCCCGAGTAGCCGCAAATGTGCTTCATCT 
AAGCCTCTGGGGTGAACATATTCTAGCCCTGAAG7VATCTGAAGTTAGACAAGATGGTTGG 
CTGGCTCCTCCAACAATCGGCCGCCACTTTGTTGGCCAACAGGCTCACAGAGAAGTGTTG 
TGTGG7^AACAAAAATGAAGATGCTCTTTCTGGCTAACCTGGAGTGTGAAACGCTCTGCCA 
GGCAGCCCTCCTCCTCTGCTCATGGAGGGCAGCCCTGTGATGTCCCTGCATGGAGCTGGT 
GAATTACTAAAAGAACATGGCATCCTCTGTGTCGTGATGGTCTGTGAATGGTGAGGGTGG 
GAGTCAGGAGACAAGACAGCGCAGAGAGGGCTGGTTAGCCGGAAAAGGCCTCGGGCTTGG 
CAAATGGAAGAACTTGAGTGAGAGTTCAGTCTGCAGTCCTCTGCTCACAGACATCTGAAA 
AGTGAATGGCCAAGCTGGTCTAGTAGATGAGGCTGGACTGAGGAGGGGTAGGCCTGCATC 
CACAGAGAGGATCCAGGCCAAGGCACTGGCTGTCAGTGGCAGAGTTTGGCTGTGACCTTT 
GCCCCTAACACGAGGAACTCGTTTGAAGGGGGCAGCGTAGCATGTCTGATTTGCCACCTG 
GATGAAGGCAGACATCAACATGGGTCAGCACGTTCAGTTACGGGAGTGGGAAATTACATG 
AGGCCTGGGCCTCTGCGTTCCCAAGCTGTGCGTTCTGGACCAGCTACTGAATTATTAATC 
TCACTTAGCGAAAGTGACGGATGAGCAGTAAGTAAGTAAGTGTGGGGATTTAAACTTGAG 
GGTTTCCCTCCTGACTAGCCTCTCTTACAGGAATTGTGAAATATTAAATGCAAATTTACA 
ACTGCAGATGACGTATGTGCCTTGAACTGAATATTTGGCTTTAAGAATGATTCTTCTTAT 
ACTCTGAAGGTGAGAATATTTTGTGGGCAGGTATCAACATTGGGGAAGAGATTTCATGTC 
TAACTAACTAACTTTATACATGATTTTTAGGAAGCTATTGCCTAAATCAGCGTCAACATG 
CAGTAAAGGTTGTCTTCAACTGACAAAA 

SEQ ID NO: 112_AI 0868 65_H 

AATGAGATGGAGAAGTACGAGCGGATCCGAGTGGTGGGGAGAGGTGCCTTCGGGATTGTG 
CACCTGTGCCTGCGAAAGGCTGACCAGAAGCTGGTGATCATCAAGCAGATTCCAGTGGAA 
CAGATGACCAAGGAAGAGCGGCAGGCAGCCCAGAATGAGTGCCAGGTCCTCAAGCTGCTC 
AACCACCCCAATGTCATTGAGTACTACGAGAACTTCCTGGAAGACAAAGCCCTTATGATC 
GCCATGGAATATGCACCAGGCGGCACTCTGGCTGAGTTCATCCAAAAGCGCTGTAATTCC 
CTGCTGGAGGAGGAGACCATCCTGCACTTCTTCGTGCAGATCCTGCTTGCACTGCATCAT 
GTGCACACCCACCTCATCCTGCACCGAGACCTCAAGACCCAGAACATCCTGCTTGACAAA 
CACCGCATGGTCGTCAAGATCGGTGATTTCGGCATCTCCAAGATCCTTAGCAGCAAGAGC 
ACCCCATGCTATATCTCCCCTGAGCTGTGTGAGGGCAAGCCCTACAACCAGAAGAGTGAC 
ATCTGGGCCCTGGGCTGTGTCCTCTACGAGCTGGCCAGCCTCAAGAGGGCTTTCGAGGCT 
GCGAACTTGCCAGCACTGGTGCTGAAGATCATGAGTGGCACCTTTGCACCTATCTCTGAC 
CGGTACAGCCCTGAGCTTCGCCAGCTGGTCCTGAGTCTACTCAGCCTGGAGCCTGCCCAG 
CGGCCACCACTCAGCCACATCATGGCACAGCCCCTCTGCATCCGTGCCCTCCTCAACCTC 
CACACCGACGGCAGAGAAGTCCGTGGCCCCCAGCAACACAGGGAGCAGGACCACCAGTGT 
CCGCTGCAGAGAGGCATCATCATGACATTCGGCAGCGGCAGCAATGGGTGCCTAGGCCAT 
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FIGURE 2GGGG 

GGCAGCCTCACTGACATCAGCCAGCCCACCATTGTGGAGGCTTTGTTGGGCTATGAAATG 

GTGCAGCAAGTGGAGGAGGCCCTGAGCTTCACACTACTAGGCTCTGCACCCCTGGACCAG 

GAGCCTCTGCTGAGTATAGACCTGGGCACTGCTCACTCAGCTGCTGTGACTGGTGAGGAG 

GACTTGGGCTCTGGAGATGTAAACAGGTTACCCAGCTGGGAGAGAGGACATCTGCTGGCT 

GGTGTGGCGTCCAGCACTGATGTGTCTACCTTCTCTGAAGGTGACTGCAAGGAGCCTGAC 

AAGTGCTGCTGGAGACACAAGCAGTGCACTGGGCACATCATCTACCCTTTCGCCTCTGAC 

TGTGTCCGCCACAGCCTGCACCTACACTCTGTCAACCACTGCAACTGTAATTCTAGGCTG 

AAGGACTCTTCAGAGGATAGCAGCAGCTCCCGGGGCGCGGGCCCAACCTGCTCCCATGTC 

ATCGAGTCCCCTTGCTTTGAGCTCACACCGGAGGAGGAGCATGTGGAGCGATTCCGGTAT 

GGCTGGTGCAAAAGCTACAGACCTGTCTCTGTGGCAGTGATCCACCATCCACTCTACCAT 

CAGTGTGGGGCAGATGATCTAAATGXXAAGAAGAGGAAGAGGAGGAGGAGGAAAAGCAAG 

CCCCCCATCCCGACACAGGTGGGGCCCGCCACCGCCTCCCCTGACCTAGGCACCAGCATG 

GCCACTGGTACCCCTGACTCCACAGCGCCCATCACCATCTGGCGCTCTGAGAGCCCCACA 

GGGAAGGGTCAGGGCAGCAAGGTGATCAAGAAGGTAAAGAAGAAAAAGGAAAAAGAGAAA 

GACAAGGAGGAGATGGATGAGAAGGCAAAGCTGAAGAAAAAAGCCAAGAAAGGCCAGTTG 

ACTAAGAAGAAAAGCCCGGTTAAATTGGAGCCTTCCCCGCCAGACGTGAGCCGATCATTA 

AGCGCAAGACAGCTGGCCAGGATGTCCGAGTCCAGCCCAGAAAGCCGGGAAGAGCTGGAG 

AGCGAGGACAGTTACAATGGCCGGGGGCAGGGAGAACTGTCCAGCGAGGATATTGTGGAA 

TCATCATCGCCCAGGAAGAGAGAGAACACAGTCCAGGCCAAAAAGACAGGGGCAAAGCCC 

TCACAAGCCAGGAAGGTAAACAAGAGAAAATCTCCCCCAGGATCAAACCCCAACCTCAGT 

TGAGGCCAGGGTGGTCAGGGTGCAGAATAAATGCCATCGAGCCTGTGGCTGGCCCTCTGC 

TGCTGTTCTCTCCCTCCAACCTGGCTGTTTCTTGCGGGGCAAGGGGTGGGCTCAGGGCTG 

CAGGGGTTTCTCAAAGGCAATCCAGCTTTCACAAAGGAAGCCCATGGGAAGGCAGGTGGG 

AGGGAAAGGAAGGGGCACAGCCCTATTTCTTCCTACCTGCTAGGACAAGGTGGAAGAGTG 

TATCTGGGGTGGGAAGGAGGGCTTCCCCTCTCTGCTGCGAGAGACTGGTCTGTGTGAAAT 

CCACTTCTGGGACAGGCAGTACTGTCTGCAGCGATACCCCCAATAAACGGAACTTTTTAA 

ccc 

SEO ID NO: 113 AA836348_H 

ATGTCGGTGCTGGGCGAGTACGAGCGACACTGCGATTCCATCAACTCGGACTTTGGGAGC 

GAGTCCGGGGGTTGCGGGGACTCGAGTCCGGGGCCTAGCGCCAGTCAGGGGCCGCGAGCC 

GGCGGCGGCGCGGCGGAGCAGGAGGAACTGCACTACATCCCCATCCGCGTCCTGGGCCGC 

GGCGCCTTCGGGGAAGCCACGCTGTACCGCCGCACCGAGGATGACTCACTGGTTGTGTGG 

AAGGAAGTCGATTTGACCCGGCTGTCTGAGAAGGAACGTCGTGATGCCTTGAATGAGATA 

GTTATTCTGGCACTGCTGCAGCACGACAACATTATTGCCTACTACAATCACTTCATGGAC 

AATACCACGCTGCTGATTGAGCTGGAATATTGTAATGGAGGGAACCTGTATGACAAAATC 

CTTCGTCAGAAGGACAAGTTGTTTGAGGAAGAGATGGTGGTGTGGTACCTATTTCAGATT 

GTTTCAGCAGTGAGCTGCATCCATAAAGCTGGAATCCTTCATAGAGATATAAAGACATTA 

AATATTTTTCTGACCAAGGCAAACCTGATAAAACTTGGAGATTATGGCCTAGCAAAGAAA 

CTTAATTCTGAGTATTCCATGGCTGAGACGCTTGTGGGAACCCCATATTACATGTCTCCA 

GAGCTCTGTCAAGGAGTAAAGTACAATTTCAAGTCTGATATCTGGGCAGTTGGCTGCGTC 

ATTTTTGAACTGCTTACCTTAAAGAGGACGTTTGATGCTACAAACCCACTTAACCTGTGT 

GTGAAGATCGTGCAAGGAATTCGGGCCATGGAAGTTGACTCTAGCCAGTACTCTTTGGAA 

TTGATCCAAATGGTTCATTCGTGCCTTGACCAGGATCCTGAGCAGAGACCTACTGCAGAT 

rAACTTCTAGATCGCCCTCTTCTCAGGAAACGCAGGAGGTCAAGCACTGTGACTGAAGCA 

CCCATTGCTGTAGTAACATCACGAACCAGTGAAGTCTATGTTTGGGGTGGTGGAAAATCC 

ACCCCCCAGAAACTGGATGTTATCAAGAGTGGCTGTAGTGCCCGGCAGGTCTGTGCAGGG 

AATACCCACTTTGCTGTGGTCACAGTGGAGAAGGAACTGTACACTTGGGTGAACATGCAA 

GGAGGCACTAAACTCCATGGTCAGCTGGGCCATGGAGACAAAGCCTCCTATCGACAGCCA 

AAGCATGTGGAAAAGTTGCAAGGCAAAGCTATCCGTCAGGTGTCATGTGGTGATGATTTC 
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ACTGTCTGTGTGACTGATGAGGGTCAGCTCTATGCCTTCGGATCAGATTATTATGGCTGC 
ATGGGGGTGGACAAAGTTGCTGGCCCTGAAGTGCTAGAACCCATGCAGCTGAACTTCTTC 
CTCAGCAATCCAGTGGAGCAGGTCTCCTGTGGAGATAATCATGTGGTGGTTCTGACACGA 
AACAAGGAAGTCTATTCTTGGGGCTGTGGCGAATATGGACGACTGGGTTTGGATTCAGAA 
GAGGATTATTATACACCACAAAAGGTGGATGTTCCCAAGGCCTTGATTATTGTTGCAGTT 
CAATGTGGCTGTGATGGGACATTTCTGTTGACCCAGTCAGGCAAAGTGCTGGCCTGTGGA 
CTCAATGAATTCAATAAGCTGGGTCTGAATCAGTGCATGTCGGGAATTATCAACCATGAA 
GCATACCATGAAGTTCCCTACACAACGTCCTTTACCTTGGCCAAACAGTTGTCCTTTTAT 
AAGATCCGTACCATTGCCCCAGGCAAGACTCACACAGCTGCTATTGATGAGCGAGGCCGG 
CTGCTGACCTTTGGCTGCAACAAGTGTGGGCAGCTGGGCGTTGGGAACTACAAGAAGCGT 
CTGGGAATCAACCTGTTGGGGGGACCCCTTGGTGGGAAGCAAGTGATCAGGGTCTCCTGC 
GGTGATGAGTTTACCATTGCTGCCACTGATGAGAAAGTATTGAATTCTAAGACCATCCGT 
TCCAATAGCAGTGGCTTATCCATTGGAACTGTGTTTCAGAGCTCTAGCCCGGGAGGAGGC 
GGCGGGGGCGGCGGTGGTGAAGAAGAGGACAGTCAGCAGGAATCTGAAACTCCTGACCCA 
AGTGGAGGCTTCCGAGGAACAATGGAAGCAGACCGAGGAATGGAAGGTTTAATCAGTCCC 
ACAGAGGCCATGGGGAACAGTAATGGGGCCAGCAGCTCCTGTCCTGGCTGGCTTCGAAAG 
GAGCTGGAAAATGCAGAATTTATCCCCATGCCTGACAGCCCATCTCCTCTCAGTGCAGCG 
TTTTCAGAATCTGAGAAAGATACCCTGCCCTATGAAGAGCTGCAAGGACTCAAAGTGGCC 
TCTGAAGCTCCTTTGGAACAC7^AACCCCAAGTAGAAGCCTCGGTAACTGAGCTTTTTGCC 
TTTGAATCACAACTAGTCACCTCGGCTGAATCCTGCAGTAACCTGTGCTGGGAAGGGAAC 
ACCACTGACTCCTCCTGCGTGTGCGTGCAGCTCTCTGCAGGTGGAGGTTGA 

SEQ ID NO: 114_R86668_H , MKK6_H 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTG 
GAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCAC 
TACACTTTTGCCCTCAACCGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 
CTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 
CGTATCTAC7VAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAG 
GCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAAT 
GCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTA 
ATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTAT 
TACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTG 
GTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTG 
ATGGAGACCTTCCTGCTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA 
CCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACA 
GCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTG 
CCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTG 
GAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGA 
GTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAG 
GACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAG 
GCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAG 
ATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGC 
ACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATCAAG 
GAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCAC 
AGACGCCTGCGCCACAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTAC 
CTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTG 
TGGGGACCCCTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTGCAG 
GGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGGGGACAATGTG 
CTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTCGGCACCTCCAAGCGGCTG 
GCAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAGTATATGGCCCCAGAA 
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FIGURE 2IIU 

ATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGC 
ACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCT 
GCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCC 
GAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCC 
CAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCC 
CCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAAC 
TCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGC 
CCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCT 
GCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAG 
AGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAG 
AATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAA 
GAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAG 
GAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCAC 
AGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGT 
CCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGT 
GTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAAT 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAG 
GGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGC 
GAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGG 
CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACG 
GACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATG 
CTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACCTC 
ATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAG 
CGAGCAGGATCCACACCAGTCACCTCTGGACCCTGA 

SEQ ID NO: 115_PAK6_H 

ATGTTTGGGAAGAAAAAGAAAAAGATTGAAATATCTGGCCCGTCCAACTTTGAACACAGG 
GTTCATACTGGGTTTGATCCACAAGAGCAGAAGTTTACCGGCCTTCCCCAGCAGTGGCAC 
AGCCTGTTAGCAGATACGGCCAACAGGCCAAAGCCTATGGTGGACCCTTCATGCATCACA 
CCCATCCAGCTGGCTCCTATGAAGACAATCGTTAGAGGAAACAAACCCTGCAAGGAAACC 
TCCATCAACGGCCTGCTAGAGGATTTTGACAACATCTCGGTGACTCGCTCCAACTCCCTA 
AGGAAAGAAAGCCCACCCACCCCAGATCAGGGAGCCTCCAGCCACGGTCCAGGCCACGCG 
GAAGAAAATGGCTTCATCACCTTCTCCCAGTATTCCAGCGAATCCGATACTACTGCTGAC 
TACACGACCGAAAAGTACAGGGAGAAGAGTCTCTATGGAGATGATCTGGATCCGTATTAT 
AGAGGCAGCCACGCAGCCAAGCAAAATGGGCACGTAATGAAAATGAAGCACGGGGAGGCC 
TACTATTCTGAGGTGAAGCCTTTGAAATCCGATTTTGCCAGATTTTCTGCCGATTATCAC 
TCACATTTGGACTCACTGAGCAAACCAAGTGAATACAGTGACCTCAAGTGGGAGTATCAG 
AGAGCCTCGAGTAGCTCCCCTCTGGATTATTCATTCCAATTCACACCTTCTAGAACTGCA 
GGGACCAGCGGGTGCTCCAAGGAGAGCCTGGCGTACAGTGAAAGTGAATGGGGACCCAGC 
CTGGATGACTATGACAGGAGGCCAAAGTCTTCGTACCTGAATCAGACAAGCCCTCAGCCC 
ACCATGCGGCAGAGGTCCAGGTCAGGCTCGGGACTCCAGGAACCGATGATGCCATTTGGA 
GCAAGTGCATTTAAAACCCATCCCCAAGGACACTCCTACAACTCCTACACCTACCCTCGC 
TTGTCCGAGCCCACAATGTGCATTCCAAAGGTGGATTACGATCGAGCACAGATGGTCCTC 
AGCCCTCCACTGTCAGGGTCTGACACCTACCCCAGGGGCCCTGCCAAACTACCTCAAAGT 
CAAAGCAAATCGGGCTATTCCTCAAGCAGTCACCAGTACCCGTCTGGGTACCACAAAGCC 
ACCTTGTACCATCACCCCTCCCTGCAGAGCAGTTCGCAGTACATCTCCACGGCTTCCTAC 
CTGAGCTCCCTCAGCCTCTCATCCAGCACCTACCCGCCGCCCAGCTGGGGCTCCTCCTCC 
GACCAGCAGCCCTCCAGGGTGTCCCATGAACAGTTTCGGGCGGCCCTGCAGCTGGTGGTC 
AGCCCAGGAGACCCCAGGGAATACTTGGCCAACTTTATCAAAATCGGGGAAGGCTCAACC 
GGCATCGTATGCATCGCCACCGAGAAACACACAGGGAAACAAGTTGCAGTGAAGAAAATG 
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CGGATAAAGTTGTCTGATTTTGGTTTCTGTGCT^ 

qPO ID NO: 116 SURTK106_H _„„ rsl A <-"rr CAAAGTCTTACCATTATAGCA 
CTGGAATGCAGTCTCAGTGACAAGTTGTGT^ 

ctSSctctatatggtgttggaggatgtggccca^ 

CTCTCRGTCTCACTTCTGCTCTCCCASTCCTA^T ^ TCGCAC A^TAAGCTCATC 
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FIGURE 2KKKK 

GAACATTCAACATGTATTGTTCATTAAGCTAGCTTCCTAGTTCCGATTAGACTAAGGAGA 
CTAAGCCTAGAGAGTCAATGTTAGAACAGTGAAAAGAATTCTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGCACAATAAATAGGAAATGTAGAAACCAAGCAAGAAGGCTTAGTAGCTCA 
GTCTTTAACAAGGGCTAGAAAAGAATGTAATCTGATATGGAAGGATAGCAGCTTCTAATT 
TTCAATCATCTGTTGATATACTGTGAAACTTATTTTATTAAATTAATATTTATTAAATGG 

SEQ ID NO: 117_AA098024_M 

CTGCAGGAGAAGCACCTGTTTCATGGGGATGTGGCTGCCAGGAACATCCTGATCCAAAGT 
GACCTGACTCCCAAACTTTGTCATCTGGGCCTGGCTTATGAAGTTCATGCCCATGGGGCC 
ATCTCCTCTGCTCGATCCAGCACCATCCCTCTCAAGTGGCTTGCTCCAGAAAGGCTTCTC 
CTGAGACCTGCAAGCATCAGGGGAGATATTTGGTCCTTTGGGATCCTGCTTTATGAGATG 
GTGACTCTAGGAGCACCACCATACCCTGAAGTCCCTCCCACCAGCATCCTACAATATCTT 
CAGAGAAAGAAAATCATGAAGAGACCCAGCAGCTGCTCACATGCCATGTACAACATCATG 
AAGTGCTGTTGGCGCTGGAGTGAGGACAGCCGCCCCTTACTTGTTCAGCTGCTCCAGCGC 
CTAGAAGCTGCTTCTAGATCTGCCGATGACAAGGCTGTGTTGCAAGTGCCAGAGTTGGTG 
GTGCCTGAACTGTATGCAGATGTGGCTGGCATCAGGGCAGAAAGCATTTCCTATAGCTTC 
AGTGTCCTTTGAAGATGGTCCTAGACAAATGACTATATATGGGTGGAATTAGTTCCTTCA 
AGAACAGAGAGAAGGAACTTTCTGTGGCCCACCAAGGGAGAAAAAAGGACATGGATCTTG 
CATCTTTCCCTAAACATTTTCCTAGACATCTGAAATGCTGCTGGATGAAGCTCTACCTCT 
ACATACCATGTACTCTTGAGCTAAGAATCACCATCAATTGTAGTTTGCTTTCCAGTCCCA 
AGGGCTGAAGTATAAGTGGTGGACCGTGTCATTCTAAAGGAGGTTTTTAAAATCTGCAAT 
GATTGTAAGGGAATTAGGCAAAAGGGCTGGTCCCACTCACTCCAGGCTGGTTTACTACTG 
AAACTAGTTTTTCTTTTCTTTTTTTTTAAGTTAAACTATTACAGAGTAAAAATAAACCAG 
ATGGGCATGAATGAACACCTTCTAATTTTTAACCATGAATTGAATATTGGAATTCATGAG 
AAAGAAAATTCTAGGTTCTTTTTGCTAAGAGGTGTTAAGGTGAGTCAATATATCCTTCAA 
GGAAAGGCTTTGTCTCATCTATGTTGACGGGACGTAAAAGTCCTCGTCCCGTTATGAAGA 
GCTGAAGAAGATCTATAAGAAACAATACTGAGCCTTTCCTTGACTATAGATAGAAGAGCA 
TCCTTTCATTGAACTCTGAGGCAGGTGGACCATGCATGATACTAAGTTTAATTAGAAGCA 
GGAGGAGTCATTTCATGATTAGGAACATTGTTCATCCCATTTGTTTGCCAGTTCCTGTAA 
GACTAAGGAGAATCAGCCTATAGAGCCAAAGCTAGAACCAGGGATAAAAAGTGTGTGTGT 
GTATAACAAATAGGAAGCATGAAAGTCGAGCAAGAAGACTTAGTAACCCAGGTGGTCATT 
AAGAGGTACAGAGAAGAAGTAATCTTATAGGAATGGATGGTAGCTTCTAATTTTTAACCA 
TTCATTGAAATAACTGTGAAGCAACTCATTAAACTAGTATTTATTGACCAAAAGTAGACT 
TTTCAGGTGTATAGCTGCCAAAATCTCTATAATAAAGAGGCTAAAAGAAAATAAATGGGA 
GTTATTTTACTAGGAAAATTAGAGAACCTATAGTTTCCAAAAAGAGATTCTTTATGTGCA 
AAATGAGATAACTCTCTACCTCACAGGGTTGGTGTGAGGAACAATGAGAATATGTATTTG 
TGTATTATGTAGAATATAATATATTCTCAATAAATACTAGTTTTTCCCCTTTC 

SEQ ID NO: 118_SGK2ALPHA_H 

GAAGAGGGCAGAGCCGTGCATGGGGCTGCTCCCCAGGACCTGAGCAGGAACCTGGAGTTT 
TCAGAGCTGCCTGATCATTGCTACAGAATGAACTCTAGCCCAGCTGGGACCCCAAGTCCA 
CAGCCCTCCAGGGCCAATGGGAACATCAACCTGGGGCCTTCAGCCAACCCAAATGCCCAG 
CCCACGGACTTCGACTTCCTCAAAGTCATCGGCAAAGGGAACTACGGGAAGGTCCTACTG 
GCCAAGCGCAAGTCTGATGGGGCGTTCTATGCAGTGAAGGTACTACAGAAAAAGTCCATC 
TTAAAGAAGAAAGAGCAGAGCCACATCATGGCAGAGCGCAGTGTGCTTCTGAAGAACGTG 
CGGCACCCCTTCCTCGTGGGCCTGCGCTACTCCTTCCAGACACCTGAGAAGCTCTACTTC 
GTGCTCGACTATGTCAACGGGGGAGAGCTCTTCTTCCACCTGCAGCGGGAGCGCCGGTTC 
CTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGTGGCCAGCGCCATTGGCTACCTGCAC 
TCCCTCAACATCATTTACAGGGATCTGAAACCAGAGAACATTCTCTTGGACTGCCAGGGA 
CACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGAAGGTGTAGAGCCTGAAGACACCACA 
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TCCACATTCTGTGGTACCCCTGAGTACTTGGCACCTG7^AGTGCTTCGGAAAGAGCCTTAT 
GATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGTCCTCTACGAGATGCTCCATGGCCTG 
CCGCCCTTCTACAGCCAAGATGTATCCCAGATGTATGAGAACATTCTGCACCAGCCGCTA 
CAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGACCTCCTGCAAAGCCTTCTCCACAAG 
GACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTTTCTTGAGATTAAGAACCATGTATTC 
TTCAGCCCCATAAACTGGGATGACCTGTACCACAAGAGGCTAACTCCACCCTTCAACCCA 
AATGTGACAGGACCTGCTGACTTGAAGCATTTTGACCCAGAGTTCACCCAGGAAGCTGTG 
TCCAAGTCCATTGGCTGTACCCCTGACACTGTGGCCAGCAGCTCTGGGGCCTCAAGTGCA 
TTCCTGGGATTTTCTTATGCGCCAGAGGATGATGACATCTTGGATTGCTAGAAGAGAAGG 
ACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTAAGGAATTACCTTCAGCTGCTAGGAA 
GAGCGACTCAAACTAACAATGGCTTCAACGAGAAGCAGGTTTATTTTTTCCAGCACATAA 
AAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAGGACAGGTCATCAGATACTCAGAGGC 
TGTATCTCTGCCCTGCCAACCTTGACAAATGGCTTCCAATGTTAGGTTTGCTACAAGATG 
GTTACTGGAGCTCTAGCTGCCTATTTTGTGTTTAGGGAAGGGAAAATGGAGGAAAGGGGA 
GAAGAGCAAAGGGCGCTTTTAAAGAGCTTTCCCAAAAGCTCCCCCCAATGACTTTTGCTT 
CCATCTCACTAACCACCCACCCCTACCTGGAATGGAGGCTGGGAAATGTGGCTTATTTGC 
TGGGTACGTGACTATCCCTAATAACAAAGGGGTTTTGACCCTAAGACATTAGGGGAGAAT 
GTTGGGTAGGCAGCCAGCCCTCTTTTACCATAGGGCCTCCTGGTGTTTGGATTTTGATCT 
CAATGTGTAAAATGACAGAGATGTAACAAGCTCATAGGGTATCAATATCTCTTATTGTTC 
TATGTTGAAAAA 

SEQ ID NO: 12 0_CCRK_H 

ATGGACCAGTACTGCATCCTGGGCCGCATCGGGGAGGGCGCCCACGGCATCGTCTTCAAG 
GCCAAGCACGTGGAGACTGGCGAGATAATTGCCCTCAAGAAGGTGGCCCTAAGGCGGTTG 
GAAGACGGCTTCCCTAACCAGGCCCTGCGGGAGATTAAGGCTCTGCAGGAGATGGAGGAC 
AATCAGTATGTGGTACAACTGAAGGCTGTGTTCCCACACGGTGGAGGCTTTGTGCTGGCC 
TTTGAGTTCATGCTGTCGGATCTGGCCGAGGTGGTGCGCCATGCCCAGAGGCCACTAGCC 
CAGGCACAGGTCAAGAGCTACCTGCAGATGCTGCTCAAGGGTGTCGCCTTCTGCCATGCC 
AAC71ACATTGTACATCGGGACCTGAAACCTGCCAACCTGCTCATCAGCGCCTCAGGCCAG 
CTCAAGATAGCGGACTTTGGCCTGGCTCGAGTCTTTTCCCCAGACGGCAGCCGCCTCTAC 
ACACACCAGGTGGCCACCAGGTCTGTGGGCTGCATCATGGGGGAGCTGTTGAATGGGTCC 
CCCCTTTTCCCGGGCAAGAACGATATTGAACAGCTTTGCTATGTGCTTCGCATCTTGGGC 
ACCCCAAACCCTCAAGTCTGGCCGGAGCTCACTGAGCTGCCGGACTAC7^ACAAGATCTCC 
TTTAAGGAGCAGGTGCCCATGCCCCTGGAGGAGGTGCTGCCTGACGTCTCTCCCCAGGCA 
TTGGATCTGCTGGGTCAATTCCTTCTCTACCCTCCTCACCAGCGCATCGCAGCTTCCAAG 
GCTCTCCTCCATCAGTACTTCTTCACAGCTCCCCTGCCTGCCCATCCATCTGAGCTGCCG 
ATTCCTCAGCGTCTAGGGGGACCTGCCCCCAAGGCCCATCCAGGGCCCCCCCACATCCAT 
GACTTCCACGTGGACCGGCCTCTTGAGGGAGTCGCTGTTGAACCCAGAGCTGATTCGGCC 
CTTCATCCTGGAGGGGTGAGAAGTTGGCCCTGGTCCCGTCTGCCTGCTCCTCAGGACCAC 
TCAGTCCACCTGTTCCTCTGCCACCTGCCTGGCTTCACCCTCCAAGGCCTCCCCATGGCC 
ACAGTGGGCCCACACCACACCTTGCCCCTTAGCCCTTGCGAGGGTTGGTCTCGAGGCAGA 
GGTCATGTTCCCAGCCAAGAGTATGAGAACATCCAGTCGAGCAGAGGAGATTCATGGCCT 
GTGCTCGGTGAGCCTTACCTTCTGTGTGCTACTGACGTACCCATCAGGACAGTGAGCTCT 
GCTGCCAGTCAAGGCCTGCATATGCAGAATGACGATGCCTGCCTTGGTGCTGCTTCCCCC 
GAGTGCTGCCTCCTGGTCAAGGAGAAGTGCAGAGAGTAA 

SEQ ID NO: 121_TESK2_H 

GAATTCGCGGCCGCTCGACGCTCAGCAGAGCTACCAGCTGCCCTGTTGGCTTCGCTGGTC 
GGATCGTCCTCCTGGCCCCGCCAAACAGGCGAGCGGCCCCGACTGTGGGGCATGGCAGTA 
GTCTCCTCGTTCTCCGCCGCCGCTAGCCTAGCTGAGTCGCCGGCTTCTGCGCTAGGGGCT 
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CCCACCGCCTCCGCAGGCTAAGGAGCCGCTGCCACCAACGAGCTGTGAGGGTTACTATGC 

TCCCTCTTTGCCGCCGTCTCCTCCTCTTGCCCGCGCAGGCACCCCTCTGGCTGCTCAGTC 

CTGCCTCAGTGTCAAACCAGAAGAGAAGTAAAATTCAACAAAAATTTATGTGTGGAGTTC 

CTTCTTAAAAGAAGAAAAAAGTGATTATTTAGACTATGGATCGGAGCAAACGGAATTCAA 

TTGCAGGATTTCCTCCACGTGTGGAGCGTCTTGAAGAGTTTGAAGGAGGTGGTGGAGGAG 

AAGGAAATGTGAGCCAGGTGGGAAGAGTTTGGCCATCTTCGTATCGAGCTCTTATAAGTG 

CCTTTTCCAGACTGACGCGTTTGGATGATTTCACCTGTGAAAAAATAGGGTCTGGCTTCT 

TTTCTGAAGTGTTCAAGGTACGACACCGAGCTTCTGGTCAGGTGATGGCTCTTAAGATGA 

ACACATTGAGCAGTAACCGGGCAAACATGCTGAAAGAAGTACAGCTCATGAATAGACTCT 

CCCATCCCAACATCCTTAGGTATATCAACTCCGGGAACCTGGAACAGTTGCTAGACAGTA 

ACCTGCATTTGCCTTGGACTGTGAGGGTAAAACTGGCCTATGACATAGCAGTGGGCCTCA 

GCTACCTTCACTTCAAAGGCATTTTTCATCGGGACCTCACATCTAAGAACTGCCTGATAA 

AGAGGGATGAGAATGGTTACTCTGCAGTGGTAGCTGACTTTGGCCTGGCTGAGAAGATCC 

CCGATGTCAGCATGGGGAGTGAGAAGCTGGCCGTGGTGGGTTCCCCATTCTGGATGGCAC 

CTGAGGTTCTCCGAGATGAGCCCTATAATGAAAAGGCAGATGTGTTCTCTTATGGTATCA 

TCCTCTGCGAGATCATCGCCCGCATCCAGGCCGATCCGGACTATCTTCCCCGCACAGAGA 

ATTTCGGGCTGGACTATGATGCTTTCCAGCACATGGTGGGAGACTGTCCCCCAGATTTTC 

TGCAACTTACTTTCAACTGCTGTAACATGGATCCCAAACTGCGCCCATCTTTTGTGGAGA 

TTGGGAAGACCCTGGAGGAAATTCTGAGCCGCCTACAGGAAGAAGAGCAGGAGAGGGATA 

GGAAGCTGCAGCCCACAGCCAGGGGACTCTTGGAGAAAGCACCTGGGGTGAAGCGACTAA 

GCTCACTGGATGACAAGATCCCCCACAAGTCACCATGCCCAAGACGTACCATCTGGCTGT 

CTCGAAGCCAGTCAGATATCTTTTCCCGTAAGCCCCCACGTACAGTGAGTGTCTTGGACC 

CATACTACCGGCCACGAGATGGTGCTGCCCGCACCCCCAAAGTCAACCCTTTTAGTGCTC 

GCCAGGACCTCATGGGGGGCAAGATCAAGTTTTTTGACCTGCCCAGCAAGTCTGTCATCT 

CTCTGGTATTTGACCTGGATGCACCAGGGCCCGGAACTATGCCCCTGGCTGACTGGCAGG 

AGCCCCTGGCCCCACCTATTCGCCGGTGGCGTTCCTTGCCTGGTTCGCCTGAGTTCTTGC 

ATCAAGAGGCTTGTCCATTTGTGGGCCGGGAAGAATCGCTATCTGATGGGCCCCCACCAC 

GCCTAAGTAGTCTCAAGTACAGAGTTAAAGAGATCCCACCATTCCGGGCATCTGCCCTAC 

CAGCTGCTCAAGCCCATGAGGCTATGGACTGCTCCATTCTCCAGGAAGAAAATGGTTTTG 

GGTCCAGGCCCCAGGGGACCAGTCCATGCCCTGCGGGTGCTTCTGAGGAGATGGAGGTAG 

AAGAAAGGCCAGCAGGCTCAACTCCAGCCACCTTCTCCACCTCAGGCATAGGCCTGCAAA 

CCCAGGGAAAGCAGGATGGGTGAGGGGGTTTAGTCCCTGCCTCACCTTGGGGATGGACCT 

TCAGCTGAAACCATATGGCCCCCTAGGTGCACAGCCTTGATTCTTCCCTGGAGCCTACAG 

AGCAGGCAGGCTAGGCCAAGCCAGGCTCAACTTCTGGGCTCCCAGTGCCCATTGGCTGTG 

TATGACGGGAGGCAGCAGTGAGAGGCCTTCCTAGTTAGGGCCAACAGCTGATACCAAGCC 

TCTGAAATCCAGCAAGGAGGTCTGCCTCCCACCAGACCCTCTCCAGTGTACTTCCCCAGA 

TAGGACCAGAGGATGTCTAGTTCTAGGCTGAGCTGGCAGGCAGCTATTACCCCGGTTCTT 

TCCCCACCCCAGGTCTGTCTCTTGCCTTTTCTTGGGGCATATAAGCTACTGAGTGGAACA 

TGGAGCTGATCAAGAGGCCGTAATGGTCATGGCTGTTTCCAGACCTGAATATTGGGTGCT 

TCTTGCCAGTATTCTAAGACATTTGAGTAATTGCTGTTTGCACTTACTGCATGGTCAGAC 

CACGTCACTACATTTCTATGCAAGGGGACAGCAAGGCAGCGTGGTGGTCATGGCTCTTAG 

CTAACCTATTCAAAGACCTTTTCCTGTTGATTAATCTATTTTCATATTTATAAAGGAGTC 

TTAATGTTCTGCCCCATAAGACTTTCAACCTTGTGGTTGGGAGTGGGGCTGGTTTTGTAG 

GCCCTAGGGCCTGCTTCTATGTATTTATCAACATGTGATACATTCAATTGGTTAAATGGT 

TTATACAGGGACTGATTTGCTTCCCTTCCTGCCATGGCTGGAGCTTTGGGAACAGTCTGT 

CCTTACAGAGCTGCAATAAGAAATAACCAAAGATGAAGCTGGTCAAATATTTTCATAACT 

TGCTTCTGTTGATTTTTTTTTTGTAAAACTTTCCCAAGACATTTTCAGACTTAAAAATAA 

AGTCAGTGTTACAGGT 
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Box I Observations where certain claims were found unsearchable (Continuation of hem 1 of first sheet) 

This Internationa! Search Report has not been established in respect ot certain claims under Article 17(2)(a) for the following reasons: 
1. ("J Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 



Claims Nos.: 31-34 

©eSJsflrfeTNfefc? t$ M&S^fol Manatee fepiFr£^rI &A/d2K)i comply with me prescribed requirements to such 
an extent thai no meaningful International Search can be carried out. specifically: 



3. Claims Nos.: 

because tney are dependent claims and are not drafted in accordance with me second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This international Searching Authority found multiple inventions in this International application, as follows: 

see additional sheet 



1. I I As ail required additional search tees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 



2. | | As an searchable claims could be searched without effort justifying an additional tee. this Authority did not invite payment 
of any additional tee. 



(completely), 23-38 (partially) 



4. | [ No required additional search tees were timely paid by the applicant. Consequently, this international Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest pT} The additional search tees were accompanied by the applicant's protest. 

j | No protest accompanied the payment ot additional search tees. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: Invention 1: Claims 1-14,26-38 (all partially) 

A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ.ID.122 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ.ID.122 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ.ID.122. 



2. Claims: Inventions 2-78: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs . 123-199, wherein invention 2 
is limited to SEO.ID. 123, invention 3 is limited to SEQ. ID. 

124, , and invention 78 is limited to SEQ. ID. 199, or a 

domain thereof; a vector and a recombinant cell comprising 
said nucleic acid molecule; a nucleic acid probe for the 
detection of said nucleic acid molecule; a polypeptide as 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199 or a fragment thereof; an antibody or 
antibody fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a kinase polypeptide 
represented by a polypeptide sequence selected from 
SEQ. IDs. 123-199. 



3. Claims: Invention 79: Claim 21 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 200 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ. ID. 200 or a 
fragment thereof; an antibody or antibody fragment having 
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specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ. 10.200. 



4. Claims: Invention 80: Claim 22 (completely) and Claims 
1-14,26-38 (all partially) 



A nucleic acid molecule encoding a kinase polypeptide as 
represented by SEQ. ID. 201 or a domain thereof; a vector and 
a recombinant cell comprising said nucleic acid molecule; a 
nucleic acid probe for the detection of said nucleic acid 
molecule; a polypeptide as represented by SEQ. ID. 201 or a 
fragment thereof; an antibody or antibody fragment having 
specific binding affinity to said polypeptide; a hybridoma 
which produces said antibody; methods for identifying a 
substance that modulates the kinase activity of said 
polypeptide; methods for treating a disease or disorder by 
administering such a substance; and methods for the 
detection of a kinase polypeptide as represented by 
SEQ. ID. 201. 



5. Claims: Inventions 81-121: Claims 1-20, 

23-38 (all partially and as far as applicable) 



A nucleic acid molecule encoding a kinase with a polypeptide 
sequence selected from SEQ. IDs. 202-242, wherein invention 81 
is limited to SEQ. ID. 202, invention 82 is limited to 

SEQ. ID. 203, , and invention 121 is limited to 

SEQ. ID. 242, or a domain thereof; a vector and a recombinant 
cell comprising said nucleic acid molecule; a nucleic acid 
probe for the detection of said nucleic acid molecule; a 
polypeptide as represented by a polypeptide sequence 
selected from SEQ. IDs .202-242 or a fragment thereof; an 
antibody or antibody fragment having specific binding 
affinity to said polypeptide; a hybridoma which produces 
said antibody; methods for identifying a substance that 
modulates the kinase activity of said polypeptide; methods 
for treating a disease or disorder by administering such a 
substance; and methods for the detection of a kinase 
polypeptide represented by a polypeptide sequence selected 
from SEQ. IDs. 202-242. 



6. Claims: Inventions 122-136: Claims 15-20, 

23-25 (all partially) and claims 1-14, 
26-38 (if applicable) 
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A nucleic acid molecule encoding a kinase polypeptide as 
represented by a 'gene name 1 selected from 'AA980090', 
'AA045601', 'AA297313', 'N23936', '5R72-18-1', '5R79-54-r, 
'5R65-16-1', 'AA065538', 'H17727' , 'W08549', 'AA430250', 
'AA139478', 'R87679', 'W65887', 'AA948538' , '5R69-23-3', and 
'5R69-26-2', wherein invention 122 is limited to 'AA980090', 

invention 123 is limited to 'AA04560r, and invention 

136 is limited to , 5R69-26-2 , 5 or a domain thereof; a vector 
and a recombinant cell comprising said nucleic acid 
molecule; a nucleic acid probe for the detection of said 
nucleic acid molecule; a polypeptide encoded by said nucleic 
acid molecule or a fragment thereof; an antibody or antibody 
fragment having specific binding affinity to said 
polypeptide; a hybridoma which produces said antibody; 
methods for identifying a substance that modulates the 
kinase activity of said polypeptide; methods for treating a 
disease or disorder by administering such a substance; and 
methods for the detection of a said polypeptide. 
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Claims Nos. : 31-34 



The search was based on the sequence listing furnished in computer 
readable form, the numbering of which differs from the numbering in the 
figures. 

Claims 31-34 refer to a ' substance that modulates the activity of a 
kinase' without giving a true technical characterization. Moreover, no 
such specific compounds are defined in the application. In consequence, 
the scope of said claims is ambiguous and vague, and their subject-matter 
is not sufficiently disclosed and supported (Art. 5 and 6 PCT). 
No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the results to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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